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Excitation of Parabolic-Index Fibers 
With Incoherent Sources 


By D. MARCUSE 
(Manuscript received May 5, 1975) 


We study the excitation of a parabolic-index fiber by an incoherent 
source. The theory vs based on approximating the fiber modes by Laguerre- 
gaussian functions. The dependence of the total light power rnjected into 
the fiber core on the separation between source and fiber, and on the trans- 
verse displacement of the source, 1s shown in graphic form. Also shown are 
far-field radiation patterns, which indicate the distribution of power versus 
mode number, for several launching conditions and plots of power versus 
azimuthal mode number for given values of the compound mode number. 
The study of launching efficiency versus source radius leads to prescriptions 
for optimtzing the ratio of source to fiber core radius without a matching 
lens. Use of a lens for matching the image of a small source to the size of 
the jiber core increases the launching efficiency relative to the power con- 
sumption of the light-emitting-source diode. 


I. INTRODUCTION 


The excitation of step-index fibers (fibers whose cores have a con- 
stant index of refraction surrounded by a lower index cladding) has 
been investigated by several authors by means of ray optics.!? In this 
paper, we study the excitation of fibers with a core whose refractive 
indices have a square-law dependence on the radial dimension (para- 
bolic-index fibers). The fibers are assumed to support many modes and 
are excited by an incoherent source—for example, a light-emitting 
diode (LED). Our analysis is based on wave optics. We assume that we 
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may use the Laguerre-gaussian modes of the infinite square-law 
medium to approximate the modes of the parabolic-index fiber. Lower- 
order modes do not carry significant amounts of power in the region of 
the core boundary so that they are approximated very well by the 
Laguerre-gaussian modes of the infinite medium. We introduce the 
effect of the core boundary by considering the Laguerre-gaussian 
modes as being cut off when their propagation constants become equal 
to the plane wave propagation constant of the cladding material. 

The Laguerre-gaussian modes have the advantage that they not 
only approximate the modes of the parabolic-index fiber, but they are 
also approximate solutions of beam waves in free space. The free-space 
modes have a beam width parameter that is a function of the z co- 
ordinate (distance from the fiber measured in the direction of its axis). 
At the fiber end, the free-space Laguerre-gaussian modes match the 
fiber modes. Thus, it is only necessary to calculate the excitation of the 
free-space Laguerre-gaussian beam modes by the incoherent source, 
if we assume that the space surrounding the fiber is matched to the 
fiber core (at least approximately) by means of an index-matching 
fluid. 

We study the excitation of the modes of the parabolic-index fiber as 
functions of the source radius, its distance from the fiber, and as a 
function of its transverse displacement. This study provides informa- 
tion about the tolerance requirements for aligning the source with 
respect to the fiber. We also discuss the optimization of the source 
diameter with regard to the total power delivered to the fiber and with 
regard to the excitation efficiency relative to the electrical power re- 
quired to drive the source. 

We conclude that the tolerance requirements for placing the source 
are modest and that either the total amount of power or the power 
excitation efficiency of the fiber can be optimized by a suitable choice 
of the source diameter. It is assumed that the fiber, as well as the 
source, have circular cross sections. 

Equation (29) states a simple law for the total amount of light power 
that may be injected into a parabolic-index fiber by an incoherent 
source of brightness B [A is defined by eq. (18) ]. The normalization 
used for the power plotted in Figs. 4 through 10 is based on the expres- 
sion for the total number of guided modes (28). 

The efficiency of the system could be improved considerably by use 
of a lens or taper to match the light output of a small-area incoherent 
source to the core of the fiber. Use of lenses or tapers may be unde- 
sirable because such matching devices introduce added complexity 
into the system. However, if high overall efficiency is an important 
requirement, matching of the output of a small-area LED to the fiber 
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core with the help of an additional optical system offers a means of 
increasing the launching efficiency. 


Il. EXCITATION OF MODES BY AN INCOHERENT SOURCE 


We consider a complete orthogonal set of modes obeying the ortho- 
normality condition? 


: [ (E, X H*)-e,drdy = Pb,,. (1) 


The symbols E, and H, indicate the electric and magnetic field 
vectors of the modes labeled » or yu, e, is a unit vector in z direction, A 
is the infinite cross-sectional area of the structure, and 6,, 1s the I<ro- 
necker delta symbol. The factor P is a normalizing constant with the 
dimension of power. It is assumed to be the same for all the modes. 

The total electric field can be expressed as the superposition of all 
the modes, 


E = > o£, (2) 
y=] 
The total power carried by the field may be expressed as‘ 


Py = ¥ P(e," (3) 


It can be shown that a current with current density j excites each 
mode according to the formula,’ 


i oe 
6=- = [ iBar. (4) 


The integral is extended over the volume in which the current density 
j exists; P is the normalizing parameter encountered in (1) and (8). 

We are now ready to apply this formalism to the excitation of modes 
by an incoherent source. As a model of an incoherent source, we con- 
sider a disc of circular cross section with radius 6 and thickness f. 
The current density inside of the disc is assumed to be a random func- 
tion with vanishing correlation length. The ensemble average of the 
absolute square magnitude of (4) is 


i , 
ae — ocr B®. aq). 
(lel?) = zap f dV f av'E?-i*)-E. (5) 


The quantity (jj*) appearing in (5) is a tensor of second rank. How- 
ever, we consider that the current in the source is composed of many 
randomly oriented and randomly phased dipoles. This assumption 
allows us to assume that the off-diagonal elements of the tensor 
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vanish, and that all diagonal elements are equal. Thus, the tensor re- 
duces to a multiple of the unit tensor J, and we may write 


(jj*) = SIé(r — 1’). (6) 
Using (6) reduces (5) to the simpler form 
LS * 
Oy : 

(lel) = Ts [= Eddy. (7) 


A, is the area of the circular cross section of the source disc. 

If we apply the formula (7) to the plane-wave modes of free space, 
we can calculate the amount of power AP that is flowing through the 
element of solid angle dQ in a given direction in space. This calculation 
results in the expression 
wot 
pee 167? 





AsnkdQ, (8) 


where 
k = wVeouo 


n = refractive index of the medium. 


The derivation of (8) is sketched in the appendix. 

Our model of an incoherent source behaves like a Lambert-law 
radiator except for a missing factor cos 6, where @ is the angle between 
the direction of observation and the normal direction to the surface of 
the disc. The factor cos 6 is missing in our theory because we treated 
the source as being transparent to radiation. In a partially opaque 
source, radiation leaving the source must originate in a volume with 
an effective thickness ¢ as seen in the direction of observation. It is 
shown in Fig. 1 that the effective thickness of the source disc depends 





aloe fe 


Fig. 1—Schematic of a partially opaque source disc. The effective width for waves 
emitted at a normal angle to the surface is ¢; for waves emitted at an angle 6, the 
effective width is ¢ cos 6. 
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on the direction of observation so that we must replace ¢ with ¢ cos @. 
Modified for the case of a partially opaque source, eq. (8) assumes 


the form 
AP = BA, cos 6dQ. (9) 


The brightness of the source is defined as 


wpoStnk . 


ee Sages 


(10) 





Equation (10) relates the product St to the measurable quantity B. 
If we introduce the z component 8 of the propagation constant nk by 
the equation 

8 = nk cos @ (11) 


and modify (7) for the case of a partially opaque source, we may write 


P(|e|?) = 





= 7 apn l, E*-E,dxdy. (12) 


The left-hand side of this expression indicates the power carried by the 
mode labeled v. This power is provided by the incoherent source with 
brightness B. The brightness is the amount of power that the unit area 
of the source radiates into the unit solid angle, its dimension is W/cm? 
sr (sr = steradians). 

Equation (12) is the starting point for our discussion of the excitation 
of the modes of the parabolic-index fiber by an incoherent source. 


lil. EXCITATION OF LAGUERRE-GAUSSIAN MODES 


The Laguerre-gaussian beam modes of free space, normalized to 
carry the power P, may be expressed as follows :*? 


Dr+3 Ho p! 3 
e,rwen Veg (p + v)! 


Wo (= y e~ (rw)? TO) (=) cos vpge”, (13) 
w? 


W W 


z= | 


with 


= fi for v=0O (14) 


1 for »#¥0O 


The beamwidth parameter is defined by the formula (k = w(e€ouo)?, n iS 
the refractive index of “free space’’) 


Q2z \?]3 
W = Wo E -+- (4) ) (15) 
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and the phase function is given as 


22 ) nkr? (16) 


y= —nke + (+ 2p +1) aretan (=, ~ OR’ 


with the phase front radius of curvature 


nkws \? 

r= 2[1+( 2)’. (17) 
E,,» 1s the x or y component of the electric field vector of the Laguerre- 
gaussian beam mode. Thus, the electric field is linearly polarized but 
is expressed in a cylindrical coordinate system with the coordinates r, 
@, and z. We may replace the function cos v@ by sin v@ without chang- 
ing any other parameter in (13). Thus, the modes are degenerate in the 
sense that two orthogonal polarizations, as well as both choices of the ¢ 
symmetry (sin v¢ or cos v¢@), are allowed for each set of mode numbers 

v, p. The function L{” is a Laguerre polynomial.® 
The origin of the z coordinate is at the narrowest point of the field 
distribution where we have w = wo. The minimum beam width wy is 
arbitrary. However, the Laguerre-gaussian beam modes are equal to 
the modes of the square-law medium,’ with refractive index dis- 


tribution 
y 2 
n= No E = (<) a), (18) 


if we set w = Wo and use? 


This choice of the beamwidth parameter wo ensures that, at z = 0, the 
transverse field distribution of the Laguerre-gaussian mode of free 
space coincides with the mode of the square-law medium (18). Both 
types of modes, the beam modes of free space and the modes of the 
square-law medium, are only approximate solutions that apply in the 
paraxial approximation that holds for small values of the refractive 
index parameter A and, for free space modes, for modes with sufficiently 
‘large values of wo. For values of z other than z = 0, the phase function 
of the modes of the square-law medium must be expressed as 


y= —fz (20) 


with the propagation constant 








3 
B= Ea = net V2A(2p + v + | . (91) 


If we assume that free space consists of a medium whose refractive 
index n is matched to an average value of the fiber core, reflection from 
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the fiber end is negligibly small. We assume that the fiber core has 
radius a and that its refractive index is given by (18) in the region 
O <r <a. In the cladding at r > a, the refractive index assumes the 


constant value 
ne = no(l — A). (22) 


The mode fields (13) [with w = wo of (19) ] approximate the modes 
of the parabolic-index fiber at radius r<a for small values of 
vy and p. For large values of the mode numbers » and p, the fields extend 
strongly beyond r = aso that (18) (with w = wo) is no longer a good 
approximation to the fiber modes. However, modes reaching strongly 
into the cladding are no longer guided by the fiber core. For this 
reason, we regard (13) (with w = wo) as an approximation for all 
guided fiber modes and consider the relation 


B = nok = no(1 — A)k (23) 


as a cutoff condition for the guided modes. 

Because the modes of the parabolic-index fiber join smoothly with 
the Laguerre-gaussian beam modes of free space, we obtain the excita- 
tion of the fiber modes by determining the excitation coefficients for 
the Laguerre-gaussian beam modes with the help of (12). Substitution 
of (18) into (12) results in 

Sar B p '2” 


PC 6526 |) = e,w2(nk)? (p + v)! 


2y 2 2 
x / (<) e-2(r/w)? jue (=) rdrdé. (24) 
As \W W 


The ratio G/k was approximated by unity. We have added the coeffi- 
cients for the modes with cos v¢ to those of the sin v@ symmetry and 
used the fact that the sum of the squares of cosine and sine is unity. 
The integration in (24) extends over the surface of the source As. 

If the surface of the source is larger than the area over which the 
mode field exists with an appreciable amplitude, the integral can be 
performed with the result 

2 
P(|e,,»|?) = = 


e, (nk)? (25) 


For v = 0, there is only one type of mode because sin vé = 0. For all 
other values of v, we have lumped two modes together. If we express 
again only the power of a single mode of a given polarization and 
azimuthal symmetry, we have, instead of (25), 


Plas) =2(2) B=5(%) B. (26) 


This is an interesting formula. First of all, it shows that each mode (of 
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the fiber or of the free-space beam modes) receives an equal amount of 
power if the incoherent source is large enough. Second, comparison of 
eq. (26) with (9) shows that each mode acts as though it receives radia- 
tion from a square of the source surface whose sides are equal to the 
wavelength and as if it collects all power radiated into the solid angle 
5 sr. It is now easy to determine the power that is collected by all the 
guided modes of the fiber. We need only multiply (26) by the number 
of modes. The total number of guided modes 1s obtained from (21) and 
the cutoff condition (28). Combining these two equations, we obtain the 
following equation for the boundary 1n mode number space: 


2 
Op +9 + Dawe = nha sie = (2) (27) 


Figure 2 shows the mode number space defined by the two variables 
y and p. The diagonal line (the hypotenuse of the triangle) is defined 
by (27). The guided modes lie inside the triangle shown in the figure. 
The total number of modes is approximately equal to four times the 
area of this triangle. The factor 4 stems from the fact that for each set 
of values of v and p, we have modes with two different polarizations 
and two different azimuthal symmetries (except for vy = 0). Thus, the 
total number of modes is 


N = (nka)? 2. (28) 


The total amount of power P; injected by a large, incoherent source 
into a square-law fiber is given as the product of (26) and (28), 


P,; = r(ra?)BA. (29) 


Equation (24) can be used to calculate the power in each mode for 
arbitrary position of the source in relation to the fiber. In general, we 
assume that the fiber is separated from the source by a distance z and 
that it is offset by an amount d as shown in Fig. 3. The ® integration in 


Vv 


p 


Fig. 2—Mode number plane p, v. The diagonal line is the boundary of the guided 
modes that are contained inside the triangular region. 
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SOURCE 


= 
ab ae 
| 

= a ¢ 


P - FIBER CORE 
—$— 7 a 


Fig. 3—Geometry of the source exciting a fiber core. 


(24) can still be performed with the result 


4nBo op! 
(nk)*w* (p + »)! 


|b—d| 2\p» D2 2 
«oe (BY coe [oe (BP 
0 W W 
oa ar’ oie)? | 7.0) = | Pie 
—+- ie (ar = 201) (=) é E & rdr ; (30) 


r+ d— =). 


P(\cv,p| )= = 


with 


(31) 


P; = 
1 aresin ( Ord 


Equation (30) applies again for one mode of given polarization and 
azimuthal symmetry. The integral in (30) must be evaluated numeri- 
cally. The z dependence of the expression is hidden in the beamwidth 
parameter w according to (15). 


IV. NUMERICAL EVALUATION AND RESULTS 


In this section we show the results of numerical evaluations of eq. 
(30). We begin by discussing the dependence of the total power in- 
jected into the fiber on the distance between the source and the fiber 
end. All length variables are normalized with respect to the fiber radius 
a. Figure 4 shows the dependence of the normalized total power on 
z/a for a source whose radius is equal to the fiber radius, b/a = 1, for 
three values of A. The normalization of the power is apparent by com- 
parison with (26). Since P; indicates the total power carried by all the 
modes, we have divided it by the total number of modes N that is 
obtained from (28). This normalization results in unit normalized 
power at z = 0. 

Figure 4 shows that, for A = 0.01, the amount of power that is in- 
jected into the fiber by the heahierent source drops to approximately 
one-half of its maximum value for z/a=11. Thus, the injection process 
is surprisingly insensitive to the separations between the source and 
the fiber end. We have checked that the curves of Fig. 4 do not change 
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Fig. 4—Normalized total power Py; injected into the parabolic-index fiber as a 
function of the distance z between source and fiber. N is the total number of modes 
[see (28)]; B is the source brightness. This figure holds for a source-to-fiber core 
radius ratio b/a = 1. 


if the value of ka is varied. Thus, the curves are universal, at least for 
large values of ka, that is, for fibers supporting a large number of modes. 
This dependence on k suggests that the curves may be derived by the 
methods of ray optics. The lowest part of the curves has almost reached 
an inverse square-law dependence on fiber-source separation. We have 
checked that a precise inverse square-law dependence is obtained for 
source-to-fiber ratios of b/a = 0.1. Thus, the inverse square law is 
reached rather slowly and will be realized by the curves of Fig. 4 for 
even larger values of z/a than those appearing in the figure. The curve 
with A = 0.01 shows that a separation of the source from the fiber end 
equal to the fiber diameter, z = 2a, causes a drop in power coupling 
efficiency by only approximately 5 percent. The tolerance to source- 
fiber separation eases for smaller values of A. The curves do not reach 
exactly the value unity at z = 0 because the source is not infinitely 
wide. However, the slight departure from unity (0.3 percent for . 
N = 400, A = 0.01) is not apparent on the scale of the figure. 

Figure 5 shows the dependence of the excitation efficiency on the 
amount of relative offset d/a for a source whose radius equals the fiber 
radius, b/a = 1, and which is located at three different distances from 
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the fiber end. The curves were computed for A = 0.01. However, the 
shapes of the curves are universal and only their vertical positions 
depend on the value of A. It is immediately apparent from this figure 
that the relative tolerance to source displacements (offsets) becomes 
more liberal as the distance between the source and the fiber end is 
increased. A source displacement of d/a = 1 causes a drop in power to 
40 percent of its maximum value if the source is placed directly at the 
fiber end, z/a = 0. For z/a = 10, the excitation efficiency has dropped 
from 0.55 to 0.34, that is, only 62 percent of its maximum value, fora 
transverse source displacement of d/a = 1. Like Fig. 4, Fig. 5 is in- 
dependent of ka if this value is large. 

So far we have concentrated on the total power injected into the 
fiber. It is interesting to consider the distribution of power over the 
various guided modes as the distance between the source and the fiber 
is increased. The far-field pattern emerging from the far end of the 
fiber (which is not facing the source) gives an indication of the mode 
distribution. Figure 6 shows the power density of the far-field radiation 
pattern as a function of angle a. The curves of this figure represent far- 
field radiation patterns for several values of the relative fiber-source 
distance z/a for A = 0.01. The curves were obtained by taking the 





0.01 0.02 0.04 0.06 0.1 0.2 0.4 06 0.81 2 4 6 8 10 
d/a 


Fig. 5—Normalized total power injected into parabolic-index fiber as a function 


of the transverse source displacement d for several values of the distance z of the 
source from the fiber and for A = 0.01. 
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Fig. 6—Far-field radiation pattern. a is the angle (in degrees) of the direction of 

the observation point with respect to the fiber axis. The curves are arbitrarily nor- 

malized to unity ata = 0. The source-to-fiber core radius ratio is b/a = 1, A = 0.01. 


0.8 


0.6 


RELATIVE POWER 


ensemble average of the absolute square value of (2), 
N 
(LEI) = ¥ (lel IEE (32) 


with the mode fields of (13) and the expansion coefficients of (30). All 
curves in Fig. 6 were computed for a source whose radius is equal to the 
fiber radius, b/a = 1, with no transverse source displacement, d/a = 0. 
The far-field pattern of a parabolic-index fiber with all modes equally 
excited corresponds to the curve labeled z/a = 0. As the source is 
moved away from the fiber end, the far-field radiation pattern narrows. 
This narrowing is caused by the fact that higher-order modes receive 
less power as the distance z is increased. The curves of Fig. 6 are nor- 
malized so that the power density at zero angles becomes unity. 

Figures 7 through 9 give more detailed insight into the distribution 
of power versus mode number. Equation (21) shows that modes with 
equal values of the compound mode number, 


M=2p+7+1, (33) 


have equal propagation constants. These modes lie on straight lines 
parallel to the diagonal line in mode-number space shown in Fig. 2. 
Figure 7 indicates the power distribution among the modes with 
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M = 19 that lie near the guided-mode boundary in mode-number 
space. If the source is placed directly in contact with the fiber, z/a = 0, 
all modes with M = 19 are almost equally excited, receiving almost the 
maximum of power. However, for z/a = 4, at a point where the total 
amount of power has dropped by only 8 percent from its maximum 
value, the highest-order mode group with M = 19 suffers a very sub- 
stantial decrease in power. It is interesting that modes with higher 
values of v receive more power (for constant values of M). This trend 
is reversed only for z/a 2 8. 

Figure 8 shows the relative power in other mode groups at a fixed 
value of z/a = 8. Modes with low compound mode number, M S 6, 
are still fully excited, but with increasing values of MW the amount of 
power in the higher-order modes drops off. Figure 9 shows the same 
trend even more strongly for a source-fiber separation of z/a = 20. 

So far we have studied the dependence of the excitation efficiency 
of the parabolic-index fiber on the source-fiber separation z/a and on 
the amount of offset d/a for b/a = 1. Figure 10 shows the normalized 
total amount of power for z/a = 0 as a function of the relative source 
radius b/a. This curve does not depend on the values of A or ka. As 
expected, the total amount of injected power drops off as b/a decreases. 
However, the decrease in total power is not proportional to the area of 
the source, as one might have expected, but is nearly proportional to 





Fig. 7—Normalized power in a given mode belonging to the mode group with com- 
pound mode number 2p + » + 1 = 19 as a function of the azimuthal mode number 
v for several values of the distance z between source and fiber. These curves apply to 
the case b/a = 1, ka = 188.5, A = 0.01, n = 1.457. 
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n= 1.457 
ka = 188.5 
A=0.01 


Poop 


(n Ko)2 
2778 








Fig. 8—Normalized power of individual modes belonging to the compound mode 
number 2p + » + 1 (whose values are given in the figure) as a function of the 
azimuthal mode number v. The source-to-fiber distance is z/a = 8, all other parame- 
ters are the same as in Fig. 7. 


the source radius. This behavior has interesting consequences for the 
optimum choice of the source radius as is discussed in the next section. 

Figure 11 shows far-field radiation patterns for several values of 
b/a. Comparison with Fig. 6 shows that the dependence of the far-field 


Pf ff ot k a= 188.5 


A= 0.01 





Fig. 9—This figure is similar to Fig. 8 with z/a = 20. 


1520 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 


ERA 


0.8 





0.6 


Pe 


29r2 BN 





(nk,)? 


0.4 


0.2 


0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 
b/a 


Fig. 10—Normalized total power as a function of the normalized source radius 6. 
This figure is independent of ka and A. 
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Fig. 11—Far-field radiation pattern from the end of the fiber for several values of 
the source-to-fiber core radius ratio b/a. 
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radiation patterns on the source radius bears a close resemblance to 
its dependence on the source-fiber separation. Higher-order modes are 
excited less strongly as the source radius decreases. 


V. SOURCE OPTIMIZATION WITHOUT LENSES 


If the source brightness B is held constant, more power is injected 
into the fiber as b/a increases to a value near unity. Beyond that value, 
no further advantage is to be gained. In fact, even though the total 
amount of injected power remains constant for b/a > 1, the overall 
efficiency decreases since regions of the source at b > a do not con- 
tribute to the excitation of the fiber, but do waste their power. If 
brightness were independent of the dimensions of the source, the 
optimum source radius would be b = a. However, light-emitting diodes 
(LEDs) tend to be brighter if their radius decreases. C. A. Burrus has 
made measurements on a special type of LED operated at two-thirds of 
its saturation current that indicate that source brightness increases 
with decreasing radius. This dependence is shown in Fig. 12, which 
was drawn from data given in Burrus’ paper. The solid line is drawn 
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_ Fig. 12—Source brightness of high-intensity LeDs according to Ref. 10. The dotted 
line is an inverse 2b law used to approximate the solid curve. 
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through the average values, and the vertical error bars indicate the 
experimental uncertainty of the brightness measurements. The dotted 
line shown in Fig. 12 is the hyperbola 

950 WwW 


~ 3b ~ G/a) — 


— 


that seems to approximate the experimental data reasonably well. To 
a rough approximation, the brightness of LEDs (at least of the Burrus 
type) seems to be inversely proportional to their radius. 

To study the excitation efficiency as a function of source radius, we 
approximate the curve of Fig. 10 by the polynomial 


(nk)? P; _ b b \? b \3 : 
with 
A, = —0.06875 
A» = 2.85 (36) 
A3 = —1.78125 


This third-order polynomial approximates the curve in Fig. 10 to 
within 1 percent. Substitution of (84) into (35) results in 
(nk)? Py; 
27? WN 





b b \? an 
= Ae ae eel | (37) 

a a 
This function is plotted in Fig. 18. Under the conditions prevailing in 
Burrus-type diodes, where the maximum attainable brightness de- 
pends on the source radius, the total power that can be injected by an 
LED in direct contact with a parabolic-index fiber has a maximum at a 
source radius of 6 = 0.8a. However, even for source radii as small as 





Fig. 13—Total power injected into the fiber with the high-brightness LEDs of Fig. 
12 as a function of the (normalized) source radius b. 
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b = 0.6a, the total power in the fiber equals the amount that is ob- 
tained with an LED whose radius equals the fiber radius. This means 
that an LED with a radius only approximately half as large as that of 
the fiber core is still almost as effective as an LED whose radius equals 
the fiber core radius. 

Disregarding the electrical input power into the LED, we would 
optimize the overall performance of the fiber system, operated without 
a matching lens, by choosing a source-to-fiber radius ratio of b/a = 0.8. 
However, a different optimum is obtained if we try to optimize the 
ratio of total power injected into the fiber to the power required to 
drive the diode. The power input to the LEp can be estimated from the 
information contained in Burrus’ paper” by multiplying the diode 
current with the energy gap voltage, V = 1.38 V at room temperature. 
This power estimate comes close to the actual power since the voltage 
developed across the LED’s terminals varies between 1.35 and 1.6 V. 
Four points obtained for the LED’s power consumption operated at 
two-thirds the saturation current are shown in Fig. 14 as a function of 
the diameter 2b of the diode. In the region between 26 = 0 and 2b = 50 
um, the power curve is approximately linear. According to the limited 
information that is available, the curve seems to turn over for larger 
values of 2b. However, since only one point (at 2b = 100 um) does not 
lie on the straight line, the shape of the curve beyond 2b = 50 um is 
not known. For sufficiently small source radii, we approximate the 
curve in Fig. 14 by the equation 


P. = (8.5)(10-%) (2b) = W.b/a, (38) 


keeping in mind that this linear law becomes questionable for 2b > 50 
pm. Substitution of (388) into (37) yields 


(nk)? We Py _ As 5 
ie NW Po ble oY og 





(39) 


This function is shown in Fig. 15. The maximum of the fiber excitation 
efficiency relative to the electrical drive power of the diode appears at 
b/a = 0.2, that is, at rather small source radii. It is important to re- 
member that, even though Fig. 15 is drawn as a function of b/a, only 6b 
is allowed to vary while a must be kept constant because W., appearing 
in the normalization coefficient, is a function of a. 

A good compromise between the maximum achievable total power 
and the desire to obtain good excitation efficiency relative to the power 
input to the LED may be to operate with a diode whose radius is ap- 
proximately one-half of the fiber radius. In this case, b/a = 0.5, we lose 
17 percent of the optimum operating power efficiency and work 19 
percent below the maximum achievable injected power. But neither 
loss of efficiency is very serious and both requirements, low diode power 
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Fig. 14—-Power consumption of the high-brightness LEDs as a function of source 
radius. 


consumption and a large amount of total power launched into the fiber, 
are still approximately satisfied. 


VI. OPERATION WITH A MATCHING LENS 


Figure 10 shows that the amount of power launched into the para- 
bolic-index fiber decreases with decreasing fiber core radius if the 
source brightness is held constant. The reason for this decrease in 1n- 





Fig. 15—Normalized ratio of total power injected into the fiber, P,, relative to the 
electrical drive power P of the diode as a function of the (normalized) source radius. 
The solid curve applies for an LED in direct contact with the fiber; the dotted curve 
describes operation with.a matching lens. 
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jected power is the loss of total source area. However, by bringing the 
source in direct contact with the fiber, a large amount of power is lost, 
because the fiber can trap only rays emitted at certain small maximum 
angles whose values depend on the point at which the ray is entering 
the fiber core. If we remove the source from the end of the fiber and 
focus its light onto the fiber end with a matching lens, we may increase 
the source image to make it coincide with the fiber core radius, but at 
the same time we inject all rays at a smaller angle. The loss in source 
brightness, caused by the magnification of its image, is thus compen- 
sated for by the fact that many of the rays, those that left the diode at 
angles too large to be trapped when the source was in direct contact 
with the fiber, are now transformed to smaller angles so that a wider 
cone of light leaving the source is able to be accepted by the fiber. 

To investigate the beneficial effect of a matching lens, we use the 
transformation laws of Laguerre-gaussian beams that have been 
formulated by IKogelnik.! We assume that the source is imaged by a 
lens onto the end of the fiber as shown in Fig. 16. The transformation 
laws of gaussian beams?! yield the result (in agreement with geo- 
metrical optics) that the beamwidth parameter wo of the fiber mode is 
transformed to the width w given by 


w= 2, Wo (40) 


The source radius b has been transformed by the lens to the radius of 
its image b’; wo is given by (19). We can evaluate the effect of the 
matching lens by writing (30) (with d = 0) in the form 


2B op! 


Peal) = TRE OM! 


2b2/w2 
J we[ L® (u)Pdu. (41) 
0 


Equation (41) expresses the amount of power in one mode. The total 
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Fig. 16—Launching geometry with a matching lens. 
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power injected into the fiber is obtained by summing over all the 
guided modes. The beamwidth parameter w of the transformed 
Laguerre-gaussian beam enters this expression only in the upper limit 
of the integral. The critical term to consider is thus 


bb” 


Equation (41) is of exactly the same form as the expression (30) for 
d = 0 in the absence of the lens. Only the beamwidth w at the position 
of the source is important. Choosing the lens in such a way that the 
image size b’ of the source becomes equal to the fiber core radius 
(b’ = a) results in wy = 2a?/wo. This means that the launching eff- 
clency through the matching lens is identical to the launching efficiency 
for a source with b = ain direct contact with the fiber, even though the 
size of the source in Fig. 16 is smaller than the fiber radius. In fact, we 
know from Fig. 10 that the excitation efficiency 1s not changed even if 
the source is made larger than the fiber core. The same is true for the 
image of the source projected onto the fiber end through the matching 
lens. Even if we project a source image that is larger than the fiber end, 
the launching efficiency is maintained if the source brightness (of the 
original, not the imaged source) is held constant. This seemingly con- 
tradictory result is caused by the fact that more and more rays arrive 
at the fiber end at smaller angles with respect to the fiber axis, making 
up the loss in image brightness that is caused by the larger lens mag- 
nification. This constancy of the launching efficiency, which is achieva- 
ble with a matching lens, breaks down only when the lens must be so 
far removed from the fiber that some of the light power, which would 
radiate from the fiber (if we reverse the direction of all the light beams), 
begins to miss the lens boundary. 

Thus, we have reached the conclusion that a matching lens can 1m- 
prove the launching efficiency of a small source to such a degree that 
we can obtain as much power as would be available from a large source 
in contact with the fiber. The total coupling efficiency with respect to 
the electrical power used by the diode may be considerably improved 
with the help of a matching lens. Instead of (89), a matching lens 
allows us to achieve the power ratio 


P,_ NW 1 
P. (nk)?W. (b/a)? 


Equation (48) is plotted as the dotted curve in Fig. 15. This expression 
does not contain the effect of the finite lens aperture that must be con- 
sidered for very large magnification. Equation (43) indicates that a 
matching lens would allow us to achieve higher launching efficiencies 


(43) 
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than may be achieved by placing the LED directly in contact with the 
fiber. 

Instead of a lens, a tapered dielectric waveguide of high refractive 
index difference between core and cladding material may be used to 
match a small LED to a larger fiber core. However, any optical matching 
device complicates the basically simple configuration of an LED in direct 
contact with the fiber. For many applications, it may be more ad- 
vantageous to suffer the additional coupling loss that results by for- 
going the procedure of matching the source to the fiber size. In- 
cidentally, a matching lens does not increase the amount of power that 
may be injected from a large source. 


Vil. CONCLUSIONS 


We have studied the excitation of parabolic-index fibers with in- 
coherent light sources and found that a source, whose area covers the 
cross section of the fiber core, injects equal power into all the modes. 
As the source is moved away from the fiber end, it injects relatively 
less power (without a lens) into higher-order (as compared to low- 
order) modes. However, the injection mechanism is quite tolerant of 
source-fiber separation. At a distance of five fiber-core diameters (as- 
suming b/a = 1 and A = 0.01), the total amount of power injected 
into the fiber core decreases only to one-half the amount that can be 
achieved if the source is placed directly in contact with the diode. If 
the source is transversely displaced, the amount of power launched into 
the fiber drops to about one-half of its maximum value for a source 
displacement equal to the fiber radius, if the source is in contact with 
the fiber. 

Without a matching lens, the amount of power launched into the 
fiber decreases with decreasing source radius (if b/a < 1). However, 
since the brightness achievable from an LED increases with decreasing 
radius, an optimum radius for maximum light-power injection into the 
fiber is obtained at a source-to-core radius ratio of b/a = 0.8. Relative 
to the electrical power requirements of the diode, the launching effi- 
ciency is optimized at b/a = 0.2. A compromise between these two 
optima may be to choose the ratio b/a = 0.5. These numbers are 
based on a special high-brightness LEp developed by Burrus.” 

Use of a matching lens allows us to inject the same amount of power 
from a small LED that would be available from a large source of equal 
brightness in direct contact with the fiber. However, because the 
achievable brightness increases with the decreasing radius of an LED 
more power can be obtained from a small LED whose light is focused 
into the fiber with a lens. Use of a matching lens also increases the 
overall efficiency of operation as shown by the dotted line in Fig. 15. 
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However, matching lenses or tapers complicate the basically simple 
launching geometry of an LED in direct contact with the fiber. 
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APPENDIX 


We sketch the derivation of eq. (8). The plane-wave modes of free 
space can be expressed as” 


(2wpoP)? 
= —t(xz+oy+B8z) 
Bg one ee e (44) 
with 
B? = nk? — x? — o?. (45) 


From (7) we obtain, by substitution of (44), 





PC) = 35q33 StAs. (46) 


The total power radiated by the source is obtained from the formula :* 
P,=2 J i ” P(\c|?)dede. (47) 


The factor 2 accounts for the two possible polarizations. We may ex- 
press the differentials of the integral in terms of the element of solid 
angle dQ into which the radiation is directed, 


dxdo = nkBdQ. (48) 


Thus, the fractional amount of power radiated into the element of 
solid angle is 
AP, = 2P(\c|?)nkgdaQ. (49) 


Substitution of (46) into (49) results in (8). 
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Lossy plastic coatings are used as a means of providing mechanical 
protection for optical fibers during the optical-cable manufacturing process. 
A model utilizing a quasi-ray analysis has been developed in this paper to 
determine the effects of lossy coatings on the transmission energy in a 
multimoded step-index optical fiber. Cladding thickness 1s the dominant 
fiber parameter that plays a critical role in preventing transmission 
loss due to a lossy coating. Other parameters that significantly affect trans- 
mission loss are transmitting wavelength, the real and imaginary part of 
the refractive index of the lossy coating, and the fiber core diameter. 


I. INTRODUCTION 


A thin lossy plastic coating applied to individual optical fibers is 
being considered as a means of decreasing crosstalk! between the 
optical fibers and as a mechanism for protecting the fibers during the 
cable-manufacturing process. The effect of the lossy coating on the 
transmission energy in a fiber is the subject of this paper.?? 

A quasi-ray tracing approach is used to describe energy propagation 
in a multimoded step-index optical fiber with a lossy plastic coating.*° 

An integral expression for the power transmitted in the fiber is 
developed in terms of the geometry of the round fiber, the intrinsic loss 
of the fiber core, the reflection coefficient at the core-cladding interface, 
and the energy distribution at the launching end of the fiber. To 
calculate the reflection coefficient at the core-cladding boundary, the 
strategy followed is to replace the round fiber by a lossy multilayered 
semi-infinite slab model. A computer evaluation of the integral expres- 
sions for the transmitted and input power has been made. Included in 
this paper are the results of a study showing the functional relationship 
between power loss due to a lossy coating and cladding thickness, 
mode energy distribution, numerical aperture, and wave length. 
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Fig. 1—Optical fiber with lossy coating. 


ll. DERIVATION OF TRANSMISSION AND REFLECTION LOSS FORMULAS 


For the configuration shown in Fig. 1, we follow a technique de- 
veloped in an earlier paper! and present an expression for the trans- 
mission of energy in the core of a multimode optical fiber whose clad- 
ding is surrounded by a lossy material. We assume that an optical source 
focuses its power on the center of the entrance end of a fiber, exciting 
meridional rays as shown in Fig. 2. We also assumed that: 


(2) The input angular power distribution of the fiber is a gaussian 
function of the form 


BO) dee Wie, (1) 


where x is a parameter that is a measure of the width of the 
input beam and also an indication of how the power is distrib- 
uted among the modes of the fiber, 6, is the critical angle of the 
fiber, and F,, is a constant amplitude. 


(21) The propagating modes within the fiber are uncoupled, with the 
absorption coefficient a equal for all modes. 


Under these assumptions, the fiber input and output powers are: 


A : 
ue | | ee 
V. ? 


Fig. 2—Meridional ray fiber excitation. 
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@max 
Pisa = Oe: J sin Be- 6l180)*dg (2) 
0 


and 


8max 


Phones / sin Ge~ Ole)? BM (9) eal see Og, (3) 
0 
where 


L sis the fiber length, 

M is the total number of bounces a ray makes while propagating 
down the fiber and is the largest integer smaller than 

L 


1 tana +5 


d , (4) 
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Fig. 3—-Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.10, d = 25.4 um. 
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Fig. 4—Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.15, d = 25.4 um. 
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d 1s the fiber core diameter, 

a is an absorption coefficient per unit length that takes into ac- 
count both the fiber bulk absorption loss and scattering loss, 

Omax 18S the maximum input angle corresponding to the critical angle 
within the fiber, and 

R (6) is the reflection coefficient at the core-cladding boundary. 


To calculate &(6), the strategy followed is to replace the round fiber 
by a lossy multilayered semi-infinite slab model. The derivation of 
(6) for the slab model is shown in the appendix. The refractive indices 
and, in turn, the impedances of the media are complex to account for 
the lossy coating. 
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Fig. 5—Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.20, d = 25.4 um. 
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Ill. SUMMARY OF RESULTS OF THE COMPUTER STUDY 


A computer program was written and the integrals (2) and (3) were 
evaluated for typical fiber parameters.* A number of studies were done 
to determine how transmission loss due to the coating varies as a 
function of cladding thickness, wavelength, core diameter, and the real 
and imaginary part of the coating refractive index. Figures 3 through 5 
show, respectively, for fiber numerical apertures of 0.10, 0.15, and 0.20, 
the relationship between transmission loss due to the lossy coating and 
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Fig. 6—Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.10, d = 50.8 um. 
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Fig. 7—Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.15, d = 50.8 um. 
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cladding thickness for a 25.4-um fiber core diameter. For the same 
numerical apertures, Figs. 6 through 8 and Figs. 9 through 11 show 
this relationship for 50.8- and 75.2-um core diameters. 

For practical cladding thickness greater than 15 um, increasing the 
cladding thickness will decrease the transmission loss due to the lossy 
coatings by approximately 0.04 dB/km per micrometer of cladding 
thickness for the higher-order modes (x = 10.0). For the lower-order 
modes (x S 0.5), a cladding thickness of 15 um should provide suffi- 
cient isolation to prevent transmission loss due to the presence of the 
lossy coating. 

Calculations were made to determine the relationships between 
transmission loss due to a lossy coating and wavelength (A), core 
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Fig. 8—Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.20, d = 50.8 um. 
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Fig. 9—-Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.10, d = 76.2 um. 


diameter (d), and the real and imaginary pasts of the refractive index 
of the coating. A thin cladding of 8 um was chosen in these calculations 
to easily illustrate the trends due to these parameters. This thin 
cladding was not intended to be a practical choice for a cladding thick- 
ness in an optical fiber. Figure 12 shows the transmission loss due to a 
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Fig. 10—Transmission loss due to coating vs d/C and vs cladding thickness; 
NA = 0.15, d = 76.2 um. 


lossy coating of 4 dB/um on a fiber whose numerical aperture was 
0.15 and core diameter 50.8 um. As expected, for a fixed cladding 
thickness, the transmission loss will increase as the wavelength in- 
creases. The increase in loss for the parameters chosen was, for the 
longer wavelengths (>0.8 um), approximately 1.5 dB/km per microme- 
ter of wavelength. Figure 13 shows the weak dependence of trans- 
mission loss on core diameter. Figures 14 and 15 show respectively the 
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Fig. 11—Transmission loss due to coating vs @/C and vs cladding thickness; 
NA = 0.20, d = 76.2 um. 
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Fig. 12—Transmission loss due to coating vs wavelength. 
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Fig. 13—-Transmission loss due to coating vs core diameter. 
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Fig. 14—Transmission loss due to coating vs real part of refractive index of coating. 
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Fig. 15—Transmission loss due to coating vs imaginary part of refractive index of 
coating. 
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transmission loss as a function of the real and imaginary parts of the 
refractive index of the coating. Using Fig. 14, note that for a fiber with 
a cladding refractive index of 1.458 and coating loss of 4 dB/yum, the 
transmission loss increases, as the real part of the refractive index of 
the coating increases, up to a maximum at a value of the refractive 
index of approximately 1.51. As one increases the refractive index of 
the coating beyond 1.51, the transmission loss levels off to a constant 
value. 

In Fig. 15, for a fixed real part of the refractive index of the coating 
equal to 1.40, the transmission loss was observed as a function of the 
coating loss (the imaginary part of the refractive index of the coating). 
The transmission loss increases rapidly with coating loss and reaches 
80 percent of its final value for a coating loss of approximately 1 dB/um. 
Further increase in the loss of the coating does not substantially affect 
the fiber transmission loss. 


IV. CONCLUSIONS 


The model described, along with an experimentally determined 
knowledge of the energy distribution (value of x), can be used to choose 
fiber parameters that will prevent transmission loss caused by the lossy 
coating. Because of the meridional-ray assumption made in the analy- 
sis, a conservative estimate of the transmission loss is predicted by the 
model. The dominant fiber parameter that plays a critical role in pre- 
venting the transmission loss is cladding thickness. A cladding thick- 
ness of at least 20 um is necessary to provide adequate isolation. The 
model also calculates the significant effect on transmission loss of 
transmitting wavelength, the real and imaginary part of the refractive 
index of the lossy coating, and the fiber core diameter. 
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APPENDIX 


Calculation of Input Impedance and Reflection Coefficient for a Multilayered 
Dielectric Medium 

In this appendix, the elementary concepts of input impedance and 
reflection coefficient are developed for a multilayered dielectric medium. 
Consider the geometry shown in Fig. 16. 

Let us suppose that between two semi-infinite media, denoted by 1 
and n + 1, there are n — 1 layers of dielectric material denoted by 
2, 3, ++, n. Let a plane wave be incident on the last layer at an angle 
of incidence @,41 and let the plane of incidence be the X — Z plane. 
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Fig. 16—Geometry used for the calculation of the reflection coefficients in a lossy 
multilayered dielectric medium. 


As a result of multiple reflections at the boundaries of the layers, two 
waves will exist in each of the media with the exception of medium 1. 
Our problem is to determine the amplitude of the reflected wave in 
medium n + 1 and, hence, the reflection coefficient. 


The following notation is used: 


d; = 2; — 2-1 = the thickness of the jth medium, 
k; = “ n; = the wave number in jth medium, 
a; = k; cos 0; = 2 component of the wave vector in the jth medium, 


oj; = ajd; = the phase change in the jth medium, 


Z; = the self impedance of the jth medium, 
7, = the input impedance looking into the jth medium from the 
7 + 1 medium. 
The electric and magnetic fields in the jth medium are written as: 
Ey = A;exp [—ia,;(2 — 2;-1)] + B; exp [a;(@ — 251), (5) 
1 
Hjz = 7 {A; exp [—ta;(z — 2;-1)] — Bj exp [ta;(2 — 2;-1)]}. (©) 


4; 


The x and ¢t dependency in this case is omitted for the sake of brevity 
but assumes the general form 


exp [2(Kn4i% SIN Ony1 — wt) |. 


A; and B; are the amplitudes of the incident and reflected waves in the 
jth medium (B; = 0). The amplitude An: of the incident wave is 
assumed to be known. To obtain the reflection coefficient of interest: 


| Rees = Bayi/Ant (7) 
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One can utilize formula (84) of Ref. 1 to express the input impedance 
looking into the nth medium from the n + Ist medium: 
ml) _ 77, tan dn 


Zp =F 


Z, — ize tan, 2” (8) 


The reflection coefficient of the incident wave can then be written in 
terms of input impedances as follows: 


Baar = LE —_ Zn. 
A nai LZ + Ln4t 


In eq. (3), the power reflection coefficient FP is used. In terms of I, & is 
defined as 


T= (9) 





R= irle = Feel 


10 
re (10) 





For the examples in Section III, a four-layered medium composed of 
the fiber core, cladding, lossy coating, and surrounding air was used 
when calculating the reflection coefficient and the transmission loss due 
to the lossy coating. The refractive index of the lossy coating was 
defined as 

N = Neoating — 16, (11) 
where 6 is the imaginary or lossy part of the refractive index of the 


coating. 


The relationship between 6 and the operationally useful term, coating 
opacity, is* 
5 = 29.9 d (opacity), (12) 


where 


opacity = the transmission loss of the coating in dB/um 
\ = wave length of the transmitted light in micrometers. 


The term opacity is introduced here since it is an easily measurable 
indicator of the loss of the coating and a convenient input variable to 
the computer program. 
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A Fiber-Optic-Cable Connector 


By C. M. MILLER 
(Manuscript received May 30, 1975) 


A technique has been developed that is potentially suitable for field- 
splicing an optical cable containing linear arrays of optical fibers. Linear 
arrays of fibers (which may reside in fiber ribbons) are placed between 
spacers that are grooved top and bottom to form stacked, rectangular arrays. 
This operation can be done without microscopes or micromantpulators. 
After potting, the ends of the two stacked arrays are polished to form cable 
terminations that are brought together in a butt joint splice. A 12 X 12 
array using this technique exhibited a mean loss of 0.42 dB for 188 splices 
with 70 percent of the losses less than 0.5 dB. Subsequent single ribbon-to- 
ribbon splices had average losses less than 0.2 dB. Launching conditions 
can be duplicated and splice losses are repeatably low for reassembled 
splices; this presumably 7s due to polished fiber ends and accurate align- 
ment. Laperience gained thus far indicates that this mass splicing method 
wil probably produce large array splices with a maximum loss of 0.5 dB. 


I. INTRODUCTION 


It is believed that splicing groups of optical fibers in the field will 
be necessary in fiber communication systems. Several investigators 
have successfully spliced individual fibers with various techniques.!~4 
Others have addressed themselves to splicing linear arrays of fibers.° 

While some aspects of linear array or fiber-ribbon splicing appear 
applicable to cable splicing in the field, the operations must be per- 
formed for each linear array in a fiber-optic cable. Good ends must be 
obtained for each fiber and, although techniques for accomplishing this 
are evolving,® still these represent additional operations required for 
each fiber or fiber group during cable splicing. Another potential 
problem in applying individual or small-group splicing to a fiber-optic 
cable is in reassembly of the spliced ribbons into a compact spliced 
cable. The individual connectors would have to be very thin, and the 
splices would have to be the same length to effect compact reassembly. 

The potential problems mentioned above stimulated investigation 
of another approach, cable splicing, by which we mean splicing all 
fibers of two cables by joining connector halves (terminations) formed 
on each cable end. 
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Il. FIBER-OPTIC-CABLE SPLICING 


As presently conceived, cable splicing involves the following 
operations: 


(2) Aligning all fibers of one end of a fiber-optic cable into a uni- 
form matrix. 
(22) Potting the structure to retain the geometry. 
(227) Grinding and polishing the ends of the potted array. 
(zv) Joining two cable ends prepared by the previous three 
operations. 


Kach of these processes 1s covered in more detail in the following 
sections. An alternative approach is also presented which uses fiber 
ends prepared by controlled breaking. *® 


Ill. ALIGNMENT OF FIBERS 


Several techniques have been attempted to align fibers in a linear 
array. Threading fibers through holes as opposed to laying fibers in 
grooves is in general a more difficult and less accurate method of fiber 
alignment. The ‘‘grooved”’ concept was adopted for this cable con- 
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Fig. 1—Stacking fiber ribbons. 


1548 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 










——e 





Ee = — A 
~ 
ori 








12 GROOVES 
7 
/ 
/ 


“ 
12 GROOVES 


40 X VIEW 4X VIEW 





nector. Since a compact splice was desired, a thin chip design was 
selected which was grooved on both sides. 

Figure 1 is a magnified view of how grooved chips are used to obtain 
fiber alignment. Ribbon ends are prepared by removing the supporting 
material as shown in Fig. 1. The process consists of interleaving chips 
and layers of fibers until all linear arrays have been stacked. 

Precision grooved chips provide the primary alignment mechanism 
for assembly of fibers into a uniform rectangular array. These chips 
were produced by the Bulova Watch Company. Figure 2 is a sketch of 
the specifications for the chip and a photograph of the cross section of 
the chip. The machine work performed to make these chips was 
excellent. Since the chips are grooved on both sides, the top and bottom 
chip of the stacked array have unoccupied grooves which can be used 
as references for alignment during polishing and subsequently to align 
the two rectangular arrays in forming the butt splice. 


IV. POTTING THE ARRAY 


A stacked array, held by a vise, can be potted by allowing epoxy 
to seep through the array. Tests with close-packed arrays of fibers 
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showed that epoxy would seep at least $ inch down the length of the 
array before curing stops all flow. This sets the maximum length of the 
grooved chip; however, based on our experience a 32-inch length ap- 
peared adequate. Approximately 15 minutes is required for epoxy to 
seep through a 23-inch stacked array. Faster curing of the epoxy occurs 
at elevated temperatures; however, at room temperature several hours 
are required for the epoxy to completely cure. 

A vise is used to hold the stacked array while epoxy is applied. 
Chips with ridges that mate with the unoccupied grooves of the 
stacked array will be referred to as negative chips. These negative 
chips are used to align the top and bottom chips of the stacked array 
while in the vise. Figure 3 shows the vise and the negative chips 
attached to the vise faces. 


V. POLISHING THE ARRAY 


To obtain low loss in a splice, the fiber end must be made flat and 
perpendicular to the fiber axis. This end preparation is usually ac- 
complished by either controlled breaking,*® sawing, or polishing. Sawing 
the epoxied arrays may yield suitable ends for splicing. Controlled 
breaking is not applicable to the epoxied array previously described, 
although polishing methods can be easily applied. Since only one 
polishing operation is required for each cable splice, regardless of the 
number of fibers or ribbons, this technique 1s probably the least time- 
consuming and the cheapest. A holding fixture for supporting and 
gradually advancing the epoxied array during the polishing sequence 
is shown in Fig. 4 with two connectors in place after polishing. The 
indicator at the rear of the fixture moves the inner core of the fixture 
(and the sample) relative to the outer cylinder. This fixture used with 
a grit sequence of 220, 800X, and 0.3 uw has produced high-quality 
ends in approximately 15 minutes. Figure 5 shows a connector after 
polishing. 


VI. FINAL ALIGNMENT 


Several final alignment methods have been used to meet two differ- 
ent needs in this area. First, an alignment method has been developed 
for use in the laboratory in making splices that can be measured and 
then disassembled. Plexiglass and steel fixtures have been used for 
this purpose in conjunction with a grooved chip negative. These 
negative chips are pressed against the top and bottom of the two con- 
nectors and then placed in the final alignment fixture which further 
aligns the two connectors and holds them in place. This arrangement 
is shown in the top part of Fig. 6. 
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Fig. 3b—Detail showing negative chips. 
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Fig. 4—Polishing fixture. 


A second alignment method is used to obtain a more compact, 
permanent splice. For this case, the negative chips are epoxied directly 
onto the grooved chip connectors, again while being held and aligned 





Fig. 5—End view of 12 X 12 connector after potting and polishing. 
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Fig. 6—Top: final alignment fixture; middle: epoxied splice; bottom: single 
connector. 


in a suitable fixture. Figure 7 is a photograph of such a splice with 
index-matching epoxy used to complete splice fabrication. 


VII. SPLICE-LOSS MEASUREMENTS 


Using the methods and fixtures described, a 12 X 12 array splice 
was fabricated and measured. Six fibers were damaged or broken during 
the fabrication of the four connectors (connectors were placed on the 
input and output of the ribbon stack in addition to the splice). As 
shown in Fig. 8, the mean loss was 0.42 dB with a maximum loss of 
1.3 dB for 188 good splices. Seventy percent of the losses were less than 
0.5 dB. 

Single ribbon splices have been fabricated and measured and yield 
lower losses than array splices. For a typical ribbon splice with index- 
matching, the maximum loss was below 0.5 dB and the average loss 
below 0.2 dB. Crosstalk coupling between adjacent splices in a typical 
ribbon splice is less than —65 dB. 


Vill. PERMANENT CABLE SPLICE 


The fiber-optic-cable connector described thus far consists of pre- 
pared cable terminations which are joined to complete the cable splice. 
If a final alignment fixture is used to hold the cable terminations in 
place, as opposed to epoxy, the splice could be disassembled. An 
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Fig. 7—Splice assembly. 


alternate approach’ can be used which, although still based on the 
grooved-chip concept, uses a different operational sequence and no 
polishing and produces a splice which cannot be disassembled. This 
approach is illustrated in Fig. 9. Here, the fiber ends will probably be 
prepared by one of the methods being developed by D. Gloge® et al. 
The stacked array is fabricated in much the same way except that two 
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Fig. 8—Histogram of splice losses. 
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Fig. 9—Single layer of a permanent cable splice. 


sets of previously prepared ends are placed in each layer. The array is 
then epoxied to form a permanent splice. This technique will be more 
difficult to assemble, and end preparation will probably be done on a 
ribbon-by-ribbon basis using controlled breaking. However, this 
permanent assembly technique should produce compact splices with 
losses approaching those of ribbon splices. 

Some difficulties associated with ribbon splicing mentioned earlier 
are present in this approach. These include multiple operations required 
for end preparations and nearly exact ribbon length requirements. The 
resulting splice will, however, be small and strong, and should easily 
meet the goal of a large-array splice with maximum loss below 0.5 dB. 
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Step-Size Transmitting Differential Coders 
for Mobile Telephony 


By N. S. JAYANT 
(Manuscript received February 7, 1975) 


In using digital speech for mobile radio, we encounter the problem of 
severe bit-error bursts. Error clustering occurs because the bit duration 1s 
typically much smaller than that of a signal ‘‘fade,”’ and average bit-error 
probabilities greater than 1 percent are not uncommon. For speech com- 
munication over such channels, this paper proposes variable step-size 
differential coders based on explicit (and error-protected) transmission of 
quantizer step size. Specifically, we discuss delta and DPCM coders to be 
referred to aS DM-AQF and DPCM-AQF, where AQF stands for adaptive 
quantization with forward estimation (and transmission) of step size. 
(Backward estimation, based on quantized-signal history, has the nice 
feature that the step-size information does not have to be explicitly trans- 
mitted. Furthermore, obtaining this information does not entail any en- 
coding delay. However, due to the dependence of step size on reconstructed 
signal history, backward estimation 1s often less reliable in the presence 
of bit errors than a scheme based on AQF.) The studies reported in this 
paper cover the problem of step-size determination in AQF, the design of 
teme-invariant first-order predictors for DPCM-AQF, and the performances 
of AQF encoders with and without burst-error-protecting ploys such as 
redundant tume-diversity coding and bit scrambling. Judging from SNR 
figures and informal listening tests, interesting results are obtained with 
the following 48-kb/s coders: three-bit DPCM-AQF with redundant error 
protection, and DM-AQF using bit scrambling. 


I. INTRODUCTION 


Recent developments in speech digitization! have prompted an 
examination of digital coding as a possibility for mobile radio telephony 
that conventionally employs analog techniques for speech transmis- 
sion. Conceivably, much of the signaling supervision and “book- 
keeping” in a mobile radio link can be digital; in this case, if the speech 
were handled digitally as well, it would be simple to interleave the 
voice bits with the control bits for transmission. Digital coding also 
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offers the possibilities of inexpensive coder-decoder implementation, 
straightforward speech encryption (by bit scrambling), and efficient 
signal regeneration. Perhaps the greatest incentive for the use of digital 
speech, however, is the thought that a properly designed digital code 
may be more resistant than analog systems to the multipath fading 
that characterizes mobile radio. 

Figure 1 shows the envelope of a Rayleigh-fading signal that is 
typical in mobile telephony.? An important parameter is the fading 
rate, which is approximately the ratio of vehicle speed V to the carrier 
wavelength \. For the example in Fig. 1, this ratio is about 15 Hz. 
Note also that the 5 m represent a total travel time of about 1s at the 
indicated vehicle speed, and that the fading is slow or correlated in the 
sense that a given fade (signal strength below a specified threshold) 
can last for several tens of milliseconds (which will represent several 
hundred speech bits for the codes of this paper). The probability of 
a fade can be decreased by an order of magnitude by the use of diversity 
reception (two-branch, equal-gain or switched diversity, for example). 
But when a fade does occur, the signal is susceptible to noise capture 
as well as to co-channel interference. The end effects, with conventional 
analog transmissions, are impulsive ‘‘pops’ and ‘“‘crackles’’ in the 
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Fig. 1—Envelope of a Rayleigh fading signal (V/A = 15.4 Hz). 


1558 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 


received speech. Without explicit signal companding, these effects can 
be severe under “‘idle-channel”’ conditions, when no speech is present 
on the channel of interest. By contrast, if ‘‘adaptive” digital modula- 
tors are used to transmit speech, it is expected that the variable step- 
size mechanism in these modulators would inherently attenuate the 
impulsive interference signals (manifested as clustered bit errors in the 
digital system) during the silences of the incoming speech. This inter- 
ference-squelching property is, in fact, expected to carry over to the 
“active-channel”’ condition with an ideally designed adaptive code 
(that exploits the statistics of signal fading or, equivalently, the time 
statistics of the bit-error bursts). Even if such an ideal design should 
be impractical, it is clear that digital coding can straightforwardly 
employ efficient burst-error-protection ploys such as time-diversity 
coding and bit-scrambling. However, these refinements (as well as the 
notion of forward estimation of step size) can involve significant 
amounts of encoding delay. 

In our search for a suitable digital coder, we have used the following 
characteristics as guidelines. The speech bandwidth should be repre- 
sentative of standard telephone quality (200 to 3200 Hz); average bit- 
error rates higher than 1 percent are possible at times; and, finally, the 
overall transmission rate should not exceed a nominal 48 kb/s. When 
we refer to a ‘“‘nominal 48-kb/s rate,”’ we mean that additional channel 
capacity (in the order of 2 to 5 kb/s) may be needed for the trans- 
mission of step-size information. 

A basic contention of this paper is that the ‘“‘optimum”’ step size for 
a speech quantizer changes slowly enough with time for the step-size 
information to be transmitted reliably in a special error-protected 
format over a typical mobile radio channel. Thus, although the main 
stream of speech-carrying bits is still subject to errors, the provision of 
a relatively error-free step size will improve the received speech quality 
to a point that makes explicit step-size transmission worthwhile. We 
show that step-size transmitting coders are of interest for bursty as 
well as independent error patterns, and we include a comparison with 
a popular error-resistant syllabic-companded quantizer that recovers 
step size from the bit stream. Iollowing Noll,’ step-size transmitting 
adaptive coders will be labelled aqr (adaptive quantization with 
forward estimation and transmission of step size), in contradistinction 
to AQB (adaptive quantization with backward estimation). 

The coders of this paper are differential. We discuss both pPcm 
(differential Pom) and pm (delta modulation). It appears from ex- 
perience! that conventional time-invariant log-pcm quantization does 
not meet the error-performance requirements of mobile telephony. 
However, the possibility of a well-designed adaptive Pcm!:** definitely 
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exists. A promising candidate is the technique of nearly instantaneous 
companding (nIc).! 

Although our studies have included informal perceptual assessments, 
most performance results in this paper are objective signal-to-noise 
ratios termed SNRT and SNRR. These reflect, respectively, the speech 
quality at the output of the local and remote decoders (T and rR stand 
for ‘‘transmitter’’ and ‘‘receiver’’). Formal definitions appear in Iig. 2. 
As.we shall note at appropriate points in the paper, an SNRT-maximizing 
encoder does not, in general, maximize SNRR, and vice versa. 

Our discussions refer to computer simulations that employed band- 
limited (200 to 3200 Hz) speech utterances (2 s or, sometimes, longer 
in duration) and bit-error patterns obtained from fading simulators.°® 
We believe that the main conclusions of this paper should hold for 
broad classes of speech and error patterns encountered in a mobile 
radio environment. However, our numerical results are often reflective 
of the specific data used in our computer simulation. To demonstrate 
real-world variabilities of these numerical results, we have employed 
variable speech data, whenever appropriate. 

Section II of this paper illustrates the time characteristics of the 
simulated burst error channel. Section III discusses the design of a 
DM-AQF coder. The section also demonstrates that simple bit-protecting 
codes are not particularly beneficial with pM-aaFr (except for the trans- 
mission of step-size information). Bit scrambling, on the other hand, 
provides a definite advantage. Suitable sampling rates for DM-aQF are 
shown to be in the order of 30 to 40 kHz. Finally, a performance com- 
parison is made between DM-AQF and a representative DM-AQB code. 
Section IV describes the design of a ppcM-aQ@F coder, and demonstrates 
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Fig. 2—Block diagram of codec simulation. 
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the utility of time-diversity coding for bit protection. It also indicates 
that a three-bit coder operating redundantly at a nominal 48 kb/s 
(and an 8-kHz sampling rate) is a better choice that a four-bit coder 
operating (also redundantly, but with less bit protection) at the same 
information rate. Section V provides a comparison of DPCM-AQF and 
DM-AQF. 


ll. BIT-ERROR PATTERNS 


2.1 Burst errors 


Two simulated-error sequences were used in this study, representing 
average error probabilities of 0.025 and 0.055. These numbers represent 
channel qualities believed to be typically ‘‘much worse than average.’’® 
The durations of the error sequences were long enough to simulate the 
transmission of all but the longest of the speech utterances being en- 
coded. For this utterance (which was 9 s long), the bit-error sequences 
were used repeatedly to cover the total speech duration. Simulated 
bit rates ranged from 24 to 48 kb/s. 

Figure 3 displays typical distribution functions for error-burst dura- 
tion D and the error-free interval J. The numbers refer to a subsegment 
of the 0.025 error rate sequence. The error rate is denoted by P(EB), 


Oo D [BURST DURATION] 
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Fig. 3—Time statistics of error bursts [P(#) = 0.025, V/A = 36.2 Hz]. 
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where B refers to burst errors. An error burst in Fig. 3 is defined to 
have a local error probability of 1. In other words, an error burst of 
length D, implies that Dy contiguous signal bits are in error. An isolated 
error, for example, is an error burst of length D,) = 1. The upper rows 
of Table I lists the average and median values of burst duration D 
and error-free interval I for the subsequence examined. The ratio of 
(Daverage) tO (Daverace + Taverage) gives the average bit-error probability 
for the subsequence (1.6/51.6 = 0.03). The ratio of (Dmedian) to 
(Dymedian + Linedian), by contrast, is as high as 0.23. Of particular interest 
is the fact that Iaverage >> Imedian. This signifies the presence of some 
error-free intervals that are extremely long, together with a preponder- 
ance of intervals that are (unfortunately) very short (in fact, not much 
longer than Davyerage). The clustered nature of the errors is somewhat 
more apparent by comparison with average and median statistics that 
apply to an appropriate random error channel: that is, a channel where 
errors occur independently at every sample, but with an average error 
probability that is the same as that of the burst-error channel. The 
lower rows of Table I shows those statistics, as calculated for a random 
error channel whose bit-error probability is 0.03. Note that the value of 
Daverage 8 Much higher (for the same average error probability) in the 
case of the bursty channel, as expected. 

Burst-error patterns, including that of Table I, were obtained from 
a fading simulator. The main components of the simulation were a 
pseudorandom binary input, an FM transmitter-receiver, a Rayleigh 
fader that took into account desired ratios of vehicle speed to carrier 
wavelength, a noise generator, a pseudorandomly modulated carrier. 
to approximate the effect of co-channel interference, and the option of 
switched-diversity reception. The numbers for the burst errors in 
Table I represent the impairment for a 24-kb/s signal-bit sequence 
(the bit duration determines the number of bits affected by a fade) 
when the mobile radio link is characterized by two-branch diversity 
reception under the following (worse than average) conditions: 


Signal-to-interference ratio = 9 dB 
Signal-to-noise ratio = © 
vehicle speed — V — 29 mi/h 


carrier wavelength — X ~ 0.388 m ~ °0-2 44. (1) 


Table |— Average and median values of D and / [P(E) = 0.03] 


Errors I average D average I median Dyedian 
Bursty 50.0 1.60 2.7 0.8 
Random 32.0 1.03 23.0 0.2 
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2.2 Scrambled errors 


Scrambled errors are of interest in sections of this paper that assume 
the scrambling of signal bits for error protection. The idea of bit 
scrambling is to expose adjacent coder bits to channel conditions that 
tend to be statistically independent. If the scrambling is pseudo- 
random, the receiver can put the received bits in proper sequence by 
an inverse unscrambling operation. To avoid the two operations of 
scrambling and unscrambling, the situation was simulated in our ex- 
periment by scrambling the known bit-error pattern and leaving the 
signal bits in their original sequence. 

Error sequences consisted of binary entries (error bits) # that were 
either 0 or 1, and each entry of 1 represented a bit error in the decoding 
of a corresponding signal bit. The scrambling was accomplished as 
follows. The error-bit sequence # was handled in blocks that were 
bits long, and each bit got a new position, given by a pseudo- 
random number (of bit intervals), as was derived from the current state 
of a maximal-length shift register with log. MW stages.® The value of 17 
was set at 1024, and the effect of scrambling is illustrated in Fig. 4, 
which is a snapshot of part of the 0.025 error-rate data. The three 
sections in the figure represent (contiguous) error sequences that are 
1024 bits long (512 per row, two rows per block). In each of the six 


SAMAR AULD ALCL Uk I 
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1 


+ Seniesa Yar Sam 3 Ve 
A 


513 1024 
M025 EE 
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2049 7560 


2561 3072 


Fig. 4—Illustration of error scrambling [P(Z) = 0.025, V/A = 36.2 Hz]. 
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rows, the lower sequence contains the burst errors (letter B), and the 
upper sequence has the scrambled errors (letter S). It is clear that a 
block length of AJ = 1024 is insufficient for true error randomization. 
Speech recordings have indicated, on the other hand, that values of 7 
as small as 64 are sufficient to achieve useful speech encryption with 
signal-bit scrambling, assuming the 24-kb/s bit rate mentioned earlier. 


2.3 Transmission rate and average error probability P(E) 


Our simulations involved signal transmission rates of 24, 32, 40, and 
48 kb/s. It is reasonable, under assumptions of constant baud rate 
(number of channel symbols/second), to expect higher bit-rate trans- 
missions to be subject to correspondingly higher error rates. For 
example, if 24 and 48 kb/s represent two-phase and four-phase modula- 
tions of channel symbols, respectively, at a fixed 24-kHz symbol rate, 
the average error probability in the 48-kb/s system is expected to be 
typically two times* as large as that in the 24-kb/s scheme.*® In the 
light of this, when we compare similar systems operating at significantly 
different bit rates in this paper (for example, 24 versus 48 kb/s) we 
assume average bit-error probabilities that are appropriately different 
(for example, 0.025 for 24-kb/s transmissions and 0.055 for 48-kb/s 
transmissions). Burst errors and scrambled errors are indicated by the 
notations EB and Es. 


lll. DM-AQF 


Figure 5 illustrates the principles of variable step size delta modula- 
tion with a forward control of step size. The buffer shown in the en- 
coder stores N input samples (typically, in linear pcm format) that 
are used to calculate the best step size A for the (future) delta modula- 
tion of the stored input block. The step size A is recomputed exactly 
once, and explicitly transmitted to the receiver, for every block of N samples. 
The rest of Fig. 5 merely represents a conventional linear delta modula- 
tor-demodulator pair.! The predictor is assumed to be time-invariant, 
and of first order. The equations describing the delta modulations are 
formally summarized below. 


b, = sgn (X, — hi: Z,_1). 
Ly hy: Z,-1 + A: b,. (2) 
Z, = hy: Z,-1 + A-b,. 


I 


The time indices r and r — 1 are not shown in the figure; however, the 


* Strictly speaking, this number is a function of the carrier-to-noise ratio and the 
modem that is employed. For example, the number can exceed two (for a typical 
carrier-to-noise ratio) if Fsk is used as the modulation system instead of psk (for 
transmitting the speech bits over the analog channel). 
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Fig. 5—Block diagram of pM-A@F codec. 


implied one-sample delay occurs in the first-order predictor. Z and Z’ 
are unfiltered staircase functions at the transmitter and receiver. The 
received bit 6’ differs from b if the error bit F is 1, and the step-size 
information A is assumed to be error-protected. For useful delta modu- 
lation, the sampling rate f should be much greater than the Nyquist 
frequency of the band-limited speech. 


3.1 Design of A, N, and h, 


So that the best step-size A may follow the statistics of the input 
speech, the following algorithms were examined. 





N 1 
A= Ki 2, |X; = X,-1| ee (3a) 
Ac= Kal Max |X; Te X,r~1| J. (3b) 
2<r<Nn 
K N ; 1 3 
a= Ke| XO - Xa | (3c) 


Figure 6 plots the signal-to-noise-ratio SNRT at the encoder as a func- 
tion of K, (n = 1, 2, 3) for the above algorithms. The numbers refer 
to 24-kHz delta modulation. Each scheme exhibits an optimal K, that 
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Fig. 6—Step-size computation for DM-AQr. 


obviously represents the best mixture of slope-overload distortion 
(which predominates for K < Kopr) and granular noise (which takes 
over for K > Kopz). It is interesting that the maximum performances 
of the three algorithms are practically the same. This suggests that the 
step sizes resulting from these algorithms may not be significantly 
different, when optimal K values are employed. The rest of the paper 
will assume the use of the ‘‘average absolute slope”’ formula 


4 ol six x 4 
a=y=7| 5! Le --l |: (4) 


Strictly speaking, this formula is optimal only for 24-kHz sampling 
and for perfect integrators (hi = 1). However, corrections for these 
factors were not found to be very significant for the values of f and 
h; used in our study, and formula (4) was therefore uniformly assumed 
for simplicity. (We may mention, however, that step-size dependencies 
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on h, and bit rate will be of interest in the design of DPCM-AQF en- 
coders; this is seen in Section IV.) 

The buffer length was set at N = 256. This represents a compromise 
among three factors: (2) a need to minimize the encoding delay (this 
suggests smaller n values), (zz) a need to keep down the information 
rate in the step-size transmitting channel (this suggests slower up- 
dating, or larger N values), and (222) the need to track the changing 
statistics of speech with an appropriate speed. A buffer length of 5 to 
10 ms turns out to be a good choice for differential coding (this is 
demonstrated quantitatively in the context of ppcmM-a@F);and N = 256 
does indeed correspond approximately to a 10-ms delay for f = 24 kHz 
(and a 5-ms delay for f = 48 kHz). 

The predictor coefficient was set to be hi = 0.9. This was nearly 
optimal from an SNRT viewpoint for the sampling frequencies of interest. 
Over a noisy channel, if one uses SNRR as a performance criterion, 
optimal values of A; tend to be smaller than 0.9. This is because 
“Jeakier’ integrators mitigate error propagation in the output of a 
differential decoder. Once again, in the interest of simplicity, a quanti- 
tative consideration of this phenomenon has been deferred to the case 
of multibit ppcm coding (Section IV). 


3.2 Bit scrambling 


Table II demonstrates how bit scrambling can provide an SNRR 
advantage in the presence of errors. As mentioned earlier, bit scram- 
bling was simulated by using scrambled errors us (in place of burst errors 
EB for an unscrambled bit stream). Informal listening tests indicate 
that the perceptual advantages of bit scrambling in DM-AQF are more 
significant than what the sNRR gains in Table II may suggest. 


3.3 Error protection by redundant coding: EP-DM-AQF 


We studied a redundant pM-a@F coder in which every pair of ad- 
jacent pM bits was protected by the transmission of a (contiguous) 
parity check bit. When the parity failed at the receiver, a possible bit 
error was detected, and the received pm bit pair were forced to form 


an alternating (+ — or — +) sequence. This is equivalent to the 
Table I! — Effect of bit scrambling in DM-AQF [P(EB) = P(ES) 
= 0.055 and entries are SNRR values in dB] 
f(kHz) Speech Burst Errors Scrambled Errors 
32 Male 7.6 8.0 
40 Female 7.8 8.8 
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Table IIl—- Comparison of DM-AQF and EP-DM-AQF 


Scheme (kHz) eevee ag P(ES) sNRR (dB) 
EP-DM-AQF oe 48 0.055 8.0 
SMAGr 39 39 0.055 71 
0.0 


DM-AQF o2 o2 0.025 10. 


imposition of a zero-slope segment in the speech waveform when the 
receiver has no confidence in the incoming bits. Table III compares the 
performance of this error-protected system (EP-DM-AQF) with that of an 
unprotected DM-AQF coder, for the example of scrambled errors. The 
unprotected system has a bit rate of 32 kb/s, while the EP-DM-AQF 
operates at 32 X 3 = 48 kb/s. We are not concerned at this point with 
questions like a specific baud rate. However, in view of transmission 
rate versus error probability relations over real channels (Section II), 
the interesting comparison in Table III is between rows 1 and 8 
(rather than between 1 and 2). It appears that the simple parity- 
check-based error protection is not being useful; the advantages due 
to error detection at the receiver are being offset (or more than offset) 
by the increased error probability characteristic of the higher trans- 
mission rate in EP-DM-AQF. A similar result has been obtained in a 
simulation of DM-A@F with correlated errors, and also with ppcm en- 
coders where only the most significant bit is error-protected by the 
use of redundancy. 


3.4 Unprotected DM-AQF with bit scrambling; choice of f 


We have considered in some detail the specific case of unprotected 
(nonredundant) pM-A@F with bit scrambling. Table IV presents sNRT 
and SNRR values for such a system at different values of f and matched 
values of error probability, P(#S). Some entries in Table IV are inter- 
polated values because error sequences with the corresponding P(HS) 
values were not available. As suggested earlier in the example of binary 
versus quaternary PSK, an obviously meaningful comparison is between 
rows 1 and 4 whose error ratios differ by a factor of two. 


Table IV—DM-AQF; Effect of f 


f (kHz) P (ES) snrT (dB) SNRR (dB) 
24 0.023 17.1 8.6 
32 0.032 212 9.6 
40 0.040 23.7 10.5 
48 0.048 26.0 11.2 
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At the transmitter end, the quantization noise is easily perceived at 
f = 24 kHz. It is barely apparent at f = 32 kHz, and a choice of 
f = 40 kHz is likely to be more than adequate for many situations. 
Notice that, as f increases, so does the difference between SNRT and 
SNRR; and the quantization noise has a lesser and lesser influence on 
the speech quality at the receiver because of the relatively greater 
contributions of channel noise. 


3.5 A comparison with syllabic-companded DM-AQB 


To demonstrate that forward step-size coding is indeed desirable for 
the mobile radio channel, the pM-aAqr scheme was compared with a 
syllabic-companded delta modulator with backward-step-size control 
(aQB). The step-size algorithm for the DM-aQB was 


A, = 0.966: A,1 + 25-[apaprT ], 
3 

dL, Or-s—p 
s=0 

= 0 otherwise. (5) 


1 if = 4, for p = Oorl or 2 


[ADAPT ], 








The algorithm is reminiscent of, if not identical to, the digitally con- 
trolled delta modulation (pcpM) scheme due to Greefkes,’ which 1s an 
AQB technique well known for its error resistance. Figure 7 demonstrates 
that, in the presence of bit errors, the performance of pM-aQ@F degrades 
more gracefully than that of the pM-a@p defined in (5). It must be 
remembered, of course, that the DM-AQ@B system is implemented more 
easily and without encoding delay.*® 


3.6 The problem of step-size transmission in AQF 


We have tacitly assumed so far that step-size information In DM-AQF 
can be very reliably transmitted, even over a fading channel, because 
step-size updating has to be done only infrequently. We shall now 
demonstrate this with some numbers. 

igure 8 illustrates a histogram of step sizes that resulted from 
utilizing (4) for a 32-kHz pM-aar encoder. It was noted that the en- 
coding was very tolerant to a maximum step-size constraint of 155, 
and a step-size resolution equal to 10; in other words, to a step-size 
dictionary of only 16 steps (5, 15, ---, 155). In practice, the maximum- 
to-minimum step size ratio would probably be greater than 31, in 
anticipation of highly nonstationary speech inputs. 

The four-bit step-size information was transmitted as follows. At 
the beginning of each block of N = 256 bits, the respective four-bit 
word was transmitted five consecutive times. Each bit in the step-size 
word was decoded on the basis of a majority count over the five 
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Fig. 7—Comparison of DM-AQF and DM-AQB. 


recelved versions of the bit. The step-size transmissions increased the 
overall bit rate from 32 kb/s to 32[ (256 + 4 X 5)/256] = 34.5 kb/s. 
For a random error rate of P(ZS) = 0.025, the snrr with the explicit 
transmission of step size, as above, was nearly identical with the value 
obtained in a simulation that tacitly assumed the presence of correct 
step-size information at the receiver. The result is not surprising; the 
probability of failure of a majority count of order 5 is given by 


P(M.C; 5) = y ead @ 


r 
~10p? if p<1, (6) 


where p is the error probability. With p = P(E#S) = 0.025, P(M.C; 5) 
= 1.64 X 10+. The probability that at least one of the bits of a step- 
size word is wrongly decoded in our scheme is therefore no greater than 
6.4 in 10,000, and there were only 250 step-size transmissions during 
the entire length of the (2-s) speech utterance being coded. 


3.7 SNRT, SNRR, and P(E) as functions of time 


We conclude our discussion of DM-AQF with an interesting demonstra- 
tion of the time dependencies of SNRT, SNRR, and P(ES), as measured 
over blocks that were N = 256 samples long. The sampling rate was 
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SAMPLING RATE f (KHz) : 24 
PEAK SPEECH AMPLITUDE : 2048 


PROBABILITY 





STEP SIZE A 
Fig. 8—Histogram of step sizes in pM-agr (peak speech amplitude = 2048). 


24 kHz, the average error probability was 0.025 [refer to Fig. 2 and 
eq. (1) for error characterization, ], and the plots on Fig. 9 used numbers 
taken once every 20 blocks (5120 samples). Notice the obvious negative 
correlation between the time functions sNRR[¢t] and P(E#S)[?t]. The 
time variation of SNRT is, of course, purely a reflection of the input 
speech material. 


IV. DPCM-AQF 


Figure 10 is a block diagram of differential pcm with forward step- 
size control. Differences from Fig. 5 consist in the use of a B-bit quan- 
tizer (B = 3 or 4 in this paper), and in the assumption of Nyquist-rate 
sampling, which obviates the need for a critical output filter. Basic 
DPCM notation is as follows: W is the normalized code word magnitude, 
e is the prediction error, and @ is the quantized value of e. The time- 
invariant (first-order) predictor coefficient is hi, and r represents an 
instantaneous (sampled) value. The received bits b, (¢ = 1, 2, ---, B) 
are different from the transmitted bits b, if a corresponding error bit 
E equals 1. The step size is A; it is assumed to be recalculated once 
every NV samples, and successfully error-protected in transmission. The 
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Fig. 9—Time variation of SNRT, SNRR, and P(E) (V/A = 36.2 Hz). 
following are the salient ppcm equations. 
by = +1; gq=1,2,---, B. 
bbgr = 0.564, + 0.5 = 0 or 1. 
Cy = XxX, o hy: ya: 
Ve = hi- Y,-1 + er. (7) 
Y, = hy: Yo 5 é,. 
é, = W,-A. 
B 
W, = > 2°-4- bby | sgn Diy. 
qua 


I’or any sample 7, the sign of the code word W is the most significant 
bit bi; the least significant bit is bz. 
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Fig. 10—Block diagram of ppcM-aQF codec. 
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4.1 Design of A, N, and h, 


AQF step sizes are derived (once for every block of N samples), using 
the formula 

1 
N-1 


Figures 11, 12, and 13 illustrate typical SnrT and SNRR dependencies on 
the parameters K,, N, and hi, respectively. The curves refer to the 
case of B = 4, P(EB) = 0.055, and to a redundant transmission tech- 
nique described in Fig. 14. It is clear that SNRT-maximizing designs are 
significantly different from the SNRR-maximizing values. Rather than 
getting bogged down in the controversial question of whether SNRT 
or SNRR is to be used as a performance criterion, we have elected, 
arbitrarily, to discuss the following SNRR-maximizing designs that were 
approximately good for the P(/) range of 0.025 to 0.055: 


N = 64 

hy = 0.6 

K,=0.50 if B=83 
= 0.25 if B=4. 


A = K,- 





N 
2 |X, — hi: X71]. (8) 


(9) 


Notice that, in Fig. 12, snrR is maximum at N = 128. However, the 
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Fig. 11—Step-size computation for ppcM-AQF. 


encoding delay is less objectionable (8 ms, instead of 16 ms) with 
N = 64. Note also that SNRT-maximizing designs call for higher values 
of both hi and K4. 

The maximum-to-minimum step-size ratio in the simulation was 
about 1000. It is possible to reduce this ratio to 100, and still provide 
useful coding of nonstationary speech.! Smaller step-size ratios enhance 
bit-error resistance. They also tend to simplify the problem of trans- 
mitting step-size information. 


1574 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 


B=4 
P{ES) = 0.055 


SNR IN DECIBELS 





32 128 256 512 1024 
N 


Fig. 12—Step-size updating in DPCM-AQF. 


4.2 Error-protected DPCM-AQF 


Figure 14 illustrates the use of time-diversity coding designed to 
protect ppcm bits from burst errors. The time-diversity is provided by 
the delay P that will be discussed presently. Figure 14a defines a 
three-bit EP-pPpcm system where the most significant bit b; is trans- 
mitted three times, and the second most significant bit be is sent twice. 
The least significant bit b3 is transmitted only once. At the receiving 
end, the value of b; is determined on the basis of a majority count over 
the three received versions. In regard to the magnitude bit bs, if the two 
versions of 62 do not agree, the receiver code word is forced to its 
smallest magnitude (the polarity is still defined by the unequivocally 
decoded value of 6;). This is equivalent to forcing a ‘“minimal-slope”’ 
segment in the decoded speech waveform when the receiver is in doubt 
about the code-word magnitude. Figure 14b defines a four-bit Ep-ppcmM 
system where only the most significant bit b; is error-protected. Once 
again, the decoding of 6; at the receiver follows a majority count over 
the three received versions thereof. Assuming 8-kHz sampling, both 
EP-DPcM systems of Fig. 14 would operate at 48 kb/s. However, the 
three-bit system of Fig. 14a has the benefit of greater error protection. 
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Fig. 13—Design of predictor coefficient h; in DPCM-AQF. 


Figure 15 shows the benefits of time diversity for the example of 
the four-bit system of Fig. 14b. It is interesting that snRR is still tend- 
ing to increase at P values as large as 1024. It can be expected that, if 
P>[D+ I Javerage) SUCCeSSive repetitions of a given bit tend to be 
affected independently by the channel. D and J are the burst duration 
and spacing mentioned in Section IJ. The ppcm-aar coders of this 
paper assume a uniform value of P = 768. For a bit rate of 48 kb/s, 
this implies a total encoding delay (from Fig. 138) of 2 P bits, or about 
o2 MS. 
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Fig. 14—Time-diversity coding for EP-DPCM. 


4.3 EP-DPCM; choice of B 


We now compare the two 48-kb/s systems of Fig. 14. Table V shows, 
for two different speech inputs, the SNRR values obtained with the 
three-bit and four-bit systems. The greater error protection in the 
three-bit system seems to make it more robust, in spite of the better 
quantization noise (SNRT) performance of a four-bit coder, and the 
better receiver-end quality of three-bit coded speech 1s very obvious 
in listening tests. The result is also mentioned by Noll.’ It is true that 
four-bit coding can provide a 6-dB superiority in snrt. It appears, on 
the other hand, that the subjective SNRT in DpcM is known to be con- 
siderably higher than a measured objective sNRT,! and the SNRT of 
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Fig. 15—Effect of time diversity on received-speech quality (V/A = 36.2 Hz). 
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Table V— EP-DPCM; Comparison of three- and four-bit systems 
(entries are SNRR values in dB; rows 1 and 2 
represent different speech inputs) 


P(EB) B=3 B=4 
0.025 14.2 12.8 
0.055 9.7 9.3 


three-bit DpcM may prove to be subjectively adequate for some 
mobile links. In that case, the system of Fig. 14a would be a good 
configuration for error-protected DPCM. 

Table VI further demonstrates the benefits of error protection for 
B = 3. In view of the transmission rate—P(EB) relationships men- 
tioned in Section II, the interesting comparison in the table is between 
rows 1 and 3, not 1 and 2. It is seen that EP-ppcm at 48-kb/s provides 
a better SNRR than unprotected 24-kb/s ppcM, in spite of the higher 
error probabilities that accompany the 48-kb/s transmissions. This 
contrasts interestingly with the results of Table II where error protec- 
tion was seen to be ineffective for pM-a@Fr. The suitability of error 
protection for ppcm (and not pM) seems to be a direct consequence of 
the multibit quantization in ppcM: it is possible to isolate and error- 
protect only the more significant ppcm bits and incur an overall re- 
dundancy of 50 to 100 percent; a majority count for a 24-kHz pm 
would immediately result in a transmission rate of 72 kb/s (and a 
redundancy of 200 percent). 

In a recently proposed, and not less effective, approach to EP-DPCM 
coding,® the prcm bits are error-protected in suitably long blocks 
rather than on a bit-by-bit basis: The time diversity reception consists 
in selecting one of two time-separated blocks on the basis of an auto- 
correlation-type quality evaluation at the receiver. 


4.4 Bit scrambling in DPCM 


Informal listening tests, as well as SNRR evaluations, have shown 
that bit scrambling, and the resulting error-randomization, is much 
less effective for multibit ppcm than for pm. The reason for this is not 


Table VI — Benefits of error protection for DPCM (B=3) 


Transmission 
Code Rate (kb/s) P(EB) SNRT SNRR 
EP-DPCM 48 0.055 19.4 12.4 
Unprotected prpcm 24 0.055 19.4 7.1 
Unprotected ppcM 24 0.025 19.4 9.6 
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well understood. However, situations exist where bit scrambling can 
provide nominal SNRR gains even for ppcm. These have been noted by 
Noll.? 


4.5 DPCM-AQB 


The problem of step-size transmission for DPCM-AQF is expected to 
be handled through techniques not very different from those discussed 
in the context of pM-agF. Following the calculation procedures of that 
section, it is estimated that virtually error-free transmission of DPCM 
step size would be possible (for the error rates considered) by the 
expenditure of about 5 kb/s of channel capacity.* To indicate the 
desirability of dedicating this kind of channel capacity for step size, 
we investigated two types of backward step-size control. One of these 
was adaptive differential quantization with a one-word memory.! Here, 
the quantized step size is modified for every sample by a factor de- 
termined solely by the magnitude of the latest code word W,. The other 
adaptive scheme derived step-size information by an algorithm similar 
to the ppcm-aQF rule (8). The summation, however, was over the most 
recent N samples of quantized speech. Neither of the above backward 
schemes performed well enough with bit errors to merit inclusion of 
their results. It is conceivable, however, that, as in DM, some kind of a 
slowly adapting or syllabic ppcM may provide a fair result for mobile 
radio. It is also conjectured that the performance of such a scheme 
would be upper-bounded by that of ppcM-aqF in the manner of Fig. 7. 
At least one approach to slowly companded prcm has been proposed 
to date.1°1! 


V. CONCLUSION 


The object of this paper was to specify two differential coders—one 
from the pm family and the other from the ppcm class—that would be 
appropriate for digitizing speech in some types of mobile radio sys- 
tems. The results of our work indeed suggest two such coders: a non- 
redundant 40-kHz pmM-agFr coder with bit scrambling and an error- 
protected three-bit DpcM-aQF operating at a nominal 48 kb/s. The 
typical capabilities of these systems are summarized in Table VII, 
which is based on the example of a female utterance, ‘The lathe is a big 
tool.”’ The transmission rates and error probabilities in Table VII are 
matched, albeit in a limited sense, as discussed earlier. Also, as em- 
phasized already, the error rates in Table VII are worse-than-average 
numbers for many mobile radio links. 


*Tf the overall transmission rate of the system is constrained to be 48 kb/s, it may 
be possible to work with a sampling rate of about 7 kHz, instead of 8 kHz, to accom- 
modate the step-size information in the 48-kb/s channel (Ref. 4). 
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Table VIl— Comparison of DM-AQF and EP-DPCM-AQF 


as Esti- 
Coder B | f(kHz) | ™Ss!00 | mate of Bit-Error SNRT | SNRR 


rate 5(kb/s) Probability (dB) | (dB) 
EP-DPCM-AQF | 3 8 48 + 6 5 P(EB) = 0.055 | 20.5 | 14.5 
DM-AQF 1 40 40+ 6 <5 P(ES) = 0.045 | 23.7 | 10.1 


In assessing the coders of Table VII, it may be worth noting that the 
DM system is more flexible. For example, the pM sampling rate can be 
lowered to 32 kHz with only a 2.5-dB loss in maximum speech quality 
SNRT (Table IV). Further, if the refinements of time-diversity coding 
(in ppcm) and bit scrambling (in pm) are eschewed, it is our experience 
that the pm system will lose less in the process. 

Obviously, 2 common denominator in the above systems is adaptive 
differential quantization. Crudely speaking, adaptive quantization 
serves to squelch channel noise, while differential coding tends to 
smear it; and the combination appears to be perceptually very desirable 
in the context of mobile telephony. 

Formal perceptual studies in this subject should appropriately in- 
clude other digital techniques such as nondifferential (ecm) and back- 
ward-adaptive (AQB) coders. The studies should also include the pos- 
sible effects of encoding delay. Clearly, the amount of this delay de- 
pends on what combination of refinements (forward coding, bit 
scrambling, and time diversity) is employed; and if the total delay gets 
to be long enough, the benefits of a better SNRR (due to reliable step- 
size information, error randomization, and redundant error protection, 
respectively) may be accompanied by a loss of echo performance over 
certain kinds of networks. The best compromise between transmitted 
speech quality, received speech quality, and encoding delay is very 
likely to be system-specific; and the nature of this compromise may 
influence or define a selection among analog techniques, conventional 
digital schemes (aqs), and step-size transmitting codes (AQF). 
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An Autocorrelation Criterion for the Time- 
Diversity Reception of Speech Over 
Burst-Error Channels 
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This paper proposes a new approach to signal selection in time-diversity 
systems. Specifically, we consider the problem of digital speech transmission 
over a burst-error channel using two-channel time-diversity reception. 

Let every speech segment (of length W) be transmitted twice so that at 
least one of the transmissions escapes an error burst, with a certain useful 
probability. Let the received speech segments be Y; and Yo. We propose 
an autocorrelation-maximizing signal selection procedure of the following 
form. 

Select Y; (or Y2) as the “‘cleaner’’ speech segment according as C(Y1, W) 
= (or <) C(Yo2, W), where 


Ww 
C(Y., W) = Dd (sgn Yur-sgn Yug-y)/(W — 1); u = 1, 2. 
r=2 


Yur ws the speech amplitude at sample r in Y,, W is a computational 
window that ts typically a few milliseconds long, and sgn Yu, is a polarity 
function that 1s assumed to have zero mean and unit variance. 

The use of sgn Yu, instead of Yur leads to a simply implemented selec- 
tion procedure, and computer simulations have demonstrated its practical 
utility. For example, in one study of three-bit DPCM coding, autocorrela- 
tion-based burst-error detection proved to be more useful than a procedure 
where DPCM samples were error-protected on a bit-by-bit basis, rather than 
in blocks. 


I. THE BURST-ERROR PROBLEM 


The research reported in this paper was motivated by the problem 
of digital speech communication over a mobile radio channel. Signal 
transmissions over such a channel are characterized by multipath 
fading. The fading is ‘‘slow’ in the sense that a given fade (signal 
strength below a specified threshold) can last for several tens of milli- 
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seconds (which will typically involve several tens or several hundred 
speech bits). The end effect of these “‘slow’ fades on digital transmis- 
sions is to introduce bursts of errors in the reception of speech- 
carrying bits. 

The time statistics of these error bursts are illustrated by the 
distribution functions in Fig. 1. D is the error-burst duration and I 
the error-free interval between successive bursts. An error burst is de- 
fined to have a local error probability of 1. In other words, a burst of 
length Do implies that Do contiguous speech bits are in error. An iso- 
lated error, for example, is an error burst of length Dp = 1. The curves 
refer to a subsegment from a bit-error sequence whose average error 
probability was 0.06. Note that the local error probability in Fig. 1 
[the ratio of Davyerage tO (Daverage + average) | 18 0.048. Notice also that 
Taverage >> Imedian, SUggesting a long tail in the interval distribution. 

The error sequence was obtained from a fading simulator,’ and it 
represents the impairment for a 24-kb/s signal-bit stream (the bit 
duration determines the number of bits affected by a fade) when the 
mobile radio link is characterized by two-branch diversity reception 
under the following (worse than average) conditions: 


6 dB 


oe) 


I 


Signal-to-interference ratio 


I 


Signal-to-noise ratio (1) 
vehicle speed = V = 29mi/h 


radio wavelength A 0.353m _ SO 


A companion paper’? provides a somewhat more elaborate discussion 
of signal fades and error bursts. 


Il. TIME-DIVERSITY CODING 


The temporal structure of clustered errors can be exploited in re- 
dundant transmission schemes where message units are repeated with 
an appropriate time spacing. The optimum time spacing is, in general, 
a function of the error statistics. For example, the spacing can be 
designed to minimize the probability that both of two consecutive 
transmissions of a given message unit are affected by an error burst or 
bursts. The message unit can be a block of speech-amplitude samples, 
or a single bit from a digital speech code, and so on. 

A recent proposal discusses the use of time diversity for three-bit 
pDPcM transmissions over mobile radio.? Briefly, redundancy is intro- 
duced in the form of three transmissions of the most significant (sign) 
bit B; in a ppcm word and two transmissions of the second most signifi- 
cant (magnitude) bit B.. The average redundancy is therefore 100 
percent. The receiver decodes the sign bit B,; on the basis of a majority 
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Fig. 1—Time statistics of burst errors [P(#) = 0.06]. 


count (over the three received versions). It also looks for unanimity 
between the two received versions of the significant magnitude bit Bz. 
If the unanimity does not exist, the ppcm word is forced to its minimum 
possible magnitude. When the spacings between the repetitions of 
pPom bits are properly designed, the technique provides a significant 
advantage over nonredundant pecm.? We comment again on this pro- 
cedure at the conclusion of Section IV. 

The purpose of this paper is to propose a different approach to time 
diversity. The method is based on error-burst detection (rather than 
single-error correction, as in a successful majority count); and the 
message unit that is error-protected is a block of contiguous speech 
amplitudes, rather than a basic speech-carrying ppcm bit or word. The 
idea of protecting message blocks using time diversity is not, in itself, 
claimed to be novel. What is interesting in our technique, however, 1s 
the method by which a high bit-error density is detected in a received 
speech segment (more strictly, in one of two segments in a diversity 
pair). The basis of such burst detection is a simple autocorrelation- 
type measurement of relative speech (or channel) quality, denoted by 
C’. Unlike a signal-to-noise ratio (SNR), the quantity C can be evaluated 
over a received segment without reference to the transmitted speech. 
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In fact, our channel evaluations, based on C, are somewhat remi- 
niscent of eye-pattern-based channel assessments in digital data 
communications. 


Il. AUTOCORRELATION C 


The proposed measurement is the correlation 
Ww 
C(X, W) = D, (sgn X,-sgn X,1)/(W — 1), (2) 


where X, represents a sampled speech amplitude, W is a computational 
window that is typically a few milliseconds long, and sgn X is a polarity 
function whose mean value and variance are assumed to be 0 and 1. 
We will also be interested in the correlations C(XQ, W) and C(Y, W), 
where the quantities XQ and Y refer to (unfiltered) staircase functions 
at the outputs of local and remote speech decoders (Fig. 2). C(XQ, W) 
and C(Y, W) are defined by operations similar to (2). 

In simulating digital transmissions of speech over burst-error 
channels, we have found that clustered transmission errors tend to 
have the following type of effect on C’: with a high probability (say, on 
the order of 0.9 or more), C(Y, W) < C'(X, W), where X and Y repre- 
sent original and received speech segments. Qualitatively, the result 
is a consequence of increased zero-crossing activity in error-corrupted 
speech waveforms. Actual values of C(Y, W) depend not only on the 
local error statistics, but also on the value of the corresponding 
C(X, W), the nature of the quantization of X (prior to transmission) 
as reflected in the value of C(XQ, W), and the extent of channel error 
propagation in the received signal (Gif the quantization is differential). 
Because of these factors, the magnitude of C(Y, W) cannot be used, 
as such, for very reliable burst-error detection. 


TRANSMITTED RECEIVED 
INPUT SPEECH SPEECH 


BITS BITS 
SPEECH scadoes BURST—-ERROR 
CHANNEL 


LOCAL 
QUANTIZED DECODER 


SPEECH 


DECODER 
RECEIVED 


SPEECH 





Fig. 2—Definition of X, XQ, and Y. 
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Fig. 3—Distributions of C(X, W), C(XQ, W), and C(Y, W) in 24-kHz delta 
moditlation [P(E) = 0.06, W = 64]. 


These points are demonstrated by the results in Fig. 3 and Table I. 
These results refer to the 24-kHz delta modulation? of the band-limited 
(200 to 3200 Hz) female speech utterance, ‘‘A lathe is a big tool.’’ The 
delta-modulation bits were transmitted through a simulated burst- 
error channel whose time statistics are shown in Fig. 1. As mentioned 
earlier, the average bit-error probability P(E) on this channel is 0.06. 
Local error probabilities, as measured over blocks of W samples, will 


Table |— Mean and median values of C(X, W) — C(Y, W) 


W = 64 Median Mean 
P(E, W) = 0.11 0.14 
P(E, uae = 0.18 0.26 
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be denoted by P(E, W). (In delta modulation, a “sample” is synony- 
mous with a “bit.” In B-bit pcm or differential pcm, a ‘‘sample’’ refers 
to an entire B-bit word.) The windows for Fig. 3 and Table I are 
W = 64 samples long. 

Figure 3 shows the distributions of C(X, 64), C(XQ, 64), and 
C(Y, 64): specifically, values of the probability that CCU, W) is less 
than A where U = X, XQ, or Y; W = 64; and —1 S A <1. The 
results refer to a subset of samples characterized by nonzero values of 
P(E, W), and an average error probability of 0.14. Notice how quanti- 
zation errors, as well as transmission errors, tend to decrease the cor- 
relation C. Correlation losses due to noise and distortion are also 
demonstrated in Table I, which summarizes mean and median values 
of [C(X, 64) — C(Y, 64) ] for two channel conditions: the case of zero 
transmission errors [a subset of blocks where P(E, W) = 0] and the 
case of nonzero transmission errors [the subset of blocks where the 
average P(E, W) = 0.14]. Incidentally, both these subsets belong to 
the set of blocks whose average P(E, W) = 0.06. The top row in 
Table I measures the effect of quantization errors (plus, strictly 
speaking, the effect of error propagations in received speech), while 
the bottom row demonstrates the contributions of local transmission 
errors. 

The distribution distances in Fig. 38 and the numbers in Table I 
both lead to the following conclusion: Although the channel quality 
[P(#, W)] has a very clear effect on the autocorrelation C, the effect 
is not strong enough for C(Y, W) to be employed, as such, as a reliable 
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measure of speech or channel quality [P(£) ]. To explain, a low value 
of C(Y, W) is often indicative of a local transmission error burst. Oc- 
casionally, however, a poor autocorrelation may simply be a reflection 
of received waveform history and/or quantization noise, and/or an 
above-average high-frequency content in the local speech input. 

A situation where channel information can be reliably extracted 
from C is in time-diversity coding. Consider, for example, the two 
speech segments Y; and Y. of a time-diversity pair (Fig. 4). The 
channel-independent factors mentioned at the end of the previous 
paragraph are exactly the same for both Y; and Y». Consequently, any 
difference between C(Y1, W) and C(Y2, W) can be safely attributed to 
differences in the channel conditions affecting the receptions Y; and Y»2. 


IV. THE USE OF C IN TIME-DIVERSITY CODING 


We propose that, for time-diversity reception, the autocorrelation 
C be used as a criterion for speech segment selection at the receiver. 
For example, with two-channel time diversity (Fig. 4), we suggest the 
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Fig. 5—Performance of C(Y, W)-based speech selector with three-bit DpcM 
[W = P = 64; P(E) = 0.06]. 
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following reception rule: 
Y = Y;, (or Y2) according as C(Y1, W) = (or <)C(Y2, W), (3) 


where 


W 
C(Y., W) = dX (sgn Yur-sgn Yuer—ay)/(W —- 1); u=1,2. 
r=2 


The effect of (8) is to select the speech segment whose signum 
(polarity) function exhibits the higher autocorrelation. The rest of this 
section presents results that demonstrate the credibility of the above 
procedure. Specifically, we point out that very strong negative corre- 
latzons exist between the following quantities: 


sgn LC(Y1, W) = C(Y2, W) 
and (4) 
sgn [P(Ai, W) — P(he, W)]. 


It is assumed that smaller P(£) values imply better speech quality so 





Fig. 6—Performance of C'(Y, W)-based speech selector with three-bit pcm [W = P 
= 64; P(£) = 0.06]. 
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that negative correlations between the quantities in (4) are indeed 
indicative of an appropriate reception rule. Most of the following dis- 
cussion refers to three-bit ppcm coding. This is an example of practical 
interest for the time-diversity coding of speech over burst-error 
channels.? 

Figures 5, 6, and 7 are scatter plots of [C(Y1, W) — C(Y2, W)] 
versus [P(£i1, W) — P(E., W)] for illustrative speech codes (Pcm, 
DPcM) and average transmission-error rates of 0.03 and 0.06. The 
speech input was the same as that used in Section III, and the scatter 
plots represent sample subsets of simulation results. The members of 
the subsets were equally spaced points that spanned the total speech 
duration of about 1.5 seconds. Notice the negative correlation between 
[C(V1, W) — C(Y2, W)] and [P(Ai, W) — P(E2, W)] in each of 
Figs. 5, 6, and 7. This negative correlation reflects the fact that (for a 
given speech input and quantization error pattern) a higher C'(Y, W) 
value implies a lower P(E, W) value, i1.e., a better speech quality. The 
very small I- and III-quadrant occupancies reflect a low probability 





Fig. 7—-Performance of C(Y, W)-based speech selector with three-bit ppcm 
[W = P = 64; P(E) = 0.03]. 
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of failure (wrong speech-segment selection for the C(Y, W)-based 
selector (3)). 

We briefly discuss the effects of correlation window length W and 
time-diversity spacing P on received speech quality. The quantities 
SNRT and SNRR refer to signal-to-noise ratios measured over the dura- 
tion of the entire speech utterance: 


SNRT = )) X7/) (Xr — XQ,) 5 
srk = 5 X2/¥ (X, ~ Y,)* ©) 


T and R refer to SNR values as measured at the local (transmitter-end) 
and remote (receiver-end) decoders (Fig. 2). We are interested in 
ppcm codes with a forward-adaptive quantizer: the step size is up- 
dated every 64 samples at the transmitter, and the step-size informa-~ 
tion communicated to the receiver in a special error-protected format.’ 
Finally, the differential coding uses a time-invariant first-order pre- 
dictor. The predictor coefficient was 0.6. This value was suggested by 
the need to dissipate the effects of channel errors in the reconstructed 
speech, as explained in the companion paper.’ 

Table II shows the effects of W and P on the received speech quality 
as measured by snrpr. It is seen that 100-percent redundancy, together 
with a good choice of W and P, can buy a more-than-4-dB improvement 
over unprotected ppcm. Incidentally, the overall transmission rate is 
approximately 48 kb/s for the time-diversity codes and 24 kb/s for 
the nonredundant code. The lower error rate (0.03) used for the latter 
is a reflection of the lower transmission rate.!.” 

Figure 8 elaborates on the performance of the optimal (W = 64, 
P = 256) time-diversity code, while Table IJI compares its perform- 
ance with that of the bit-protecting scheme? mentioned in Section II. 

The diversity systems are formally sketched in Fig. 9. The encoding 
delays (P + W for block protection and 2P’ for bit protection) are 


Table Il — Effect of W and P on SNRR [83-bit DPCM; P(E) = 0.06] 


W P SNRT SNRR 
(Number of 8 kHz-samples) (dB) 

64 64 20.4 14.5 
128 128 20.4 12.1 
256 256 20.4 13.3 

64 256 20.4 15.8 

Unprotected 3-bit ppcm with 20.4 11.5 
P(E) = 0.03 
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Fig. 8—Performance of C(Y, W)-based speech selector with three-bit ppcm 
[W = 64; P = 256; P(E) = 0.06]. 


chosen to be of the same order of magnitude. (Both the schemes are 
expected to perform slightly better with longer encoding delays.) 
Table III indicates a slight SNRR superiority for the block-protection 
technique, especially at the higher error rate. What is more significant 
than the SNRR advantage is a perceptual effect; the block-protected 
speech sounds considerably crisper. The companion paper? includes 


Table II[— SNRR values (dB) in block-protecting and 
bit-protecting schemes for time-diversity coding 
of three-bit DPCM speech 


P(E) Bit Protection Block Protection 
0.000 20.4 20.4 
0.024 17.0 17.4 
0.054 14.5 15.8 
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Fig. 9--Time-diversity coding based on (a) block protection (W = 8 ms, P = 32 
ms); (b) bit protection (2P’ = 32 ms). 


more observations on the speech quality resulting from error-protected 
DPCM. 


V. CONCLUSION 


This paper has demonstrated the capabilities of a new technique for 
signal selection in time-diversity systems. The results of Table III are 
a good indication of the practical utility of the new technique. We 
believe, however, that the contribution of this paper consists not in the 
specific quality improvements (over bit-protecting systems) in Table 
III, but in the fact that the autocorrelation of the most significant 
bit (polarity function) is indeed a useful measure of relative signal 
quality over noisy channels. This is demonstrated mainly in the scatter 
plots in Figs. 5, 6, 7, and 8. The use of the most significant bit in 
evaluating signal quality leads obviously to simple implementations. 
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Fig. 10—Implementation of an autocorrelation-based block selector. 


1594 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 


A possible configuration for an autocorrelation-maximizing signal 
selector is depicted in Fig. 10. 
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A Comparative Study of Various Quantization 
Schemes for Speech Encoding 


By P. NOLL 
(Manuscript received February 7, 1975) 


In this paper, the performance limits, as given by the signal-to-noise 
ratio (s/n), are described for different speech-encoding schemes including 
adaptive quantization and (linear) adaptive prediction schemes. The 
comparison 1s made on the basis of computer simulations using 8-kHe- 
sampled speech signals of one speaker. Different bit rates (two bits per 
sample—five bits per sample) have been used. 

A three-bit-per-sample PCM scheme with a nonadaptive u100 quantizer 
leads to an s/n value of approximately 9 dB. A maximum s/n value of 
approximately 25 dB has been reached using an encoding scheme tin- 
cluding both adaptwe quantization and adaptive prediction. HKntropy 
coding of the quantizer output symbols leads to an additional gain in 
s/n of nearly 3 dB. 


l. INTRODUCTION 


Design of an efficient encoding scheme requires some knowledge of 
the statistics of the signal. Efforts to improve the performance of pcm 
systems have taken two primary directions: 


(2) Use of quantizing schemes based on knowledge of the (one- 
dimensional) probability density function (epF) of the samples 
to be quantized. 

(77) Use of quantizing schemes exploiting the correlation between 
successive samples. 


If we had an a priort knowledge of the statistics of the samples, a 
nearly optimum quantization scheme could be used consisting of: 


(2) A quantizer matched to the pprF of the signal to be quantized. 
(21) A predictor optimized for the given autocorrelation function of 
the signal. 


The predictor lowers the variance of the signal to be quantized by 
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removing the correlation between successive samples. This is done by 
subtracting an estimation value from each incoming sample; the differ- 
ence can be quantized, encoded, and transmitted (differential pcm 
= DPCM). 

In digital speech-encoding systems, we have only a small amount 
of a priort knowledge of the statistics which, in addition, usually 
change with time: 


(2) The long-period mean level differs from speaker to speaker. 
(22) At a given mean level, the instantaneous level changes be- 
cause of variations in speech sounds. 
(212) The correlations between successive samples change because of 
variations In speech sounds. 


To overcome these problems of unknown statistics, adaptive quan- 
tization and adaptive prediction schemes must be used. In these 
schemes, local estimates of the statistical parameters are calculated. 
The quantizer and/or predictor are then optimized based on these 
estimates.! 3 

This paper compares different encoding schemes that include: 


(2) Fixed quantizers. 
(72) Adaptive quantizers. 
(211) Fixed predictors. 
(2v) Adaptive predictors. 


The comparison is done on the basis of computer simulations; the 
signal-to-quantization noise ratio (s/n) has been used as the criterion 
for the comparisons. It is believed, however, that the s/n understates 
the subjectively perceived performance of encoders that have differ- 
ential quantizers (the ppcm schemes in this paper).! 


I. DESCRIPTION OF THE ENCODING SCHEMES 


A computer program has been written that allows the simulation of 
encoding schemes combining the possibilities of nonadaptive or 
adaptive quantization and nonadaptive or adaptive prediction. The 
schemes that have been used are described in the following sections. 


2.1 Fixed and adaptive quantizers 


If the quantizer is nonadaptive, its characteristic is assumed to be 
logarithmic. Optimum, i.e., s/n-maximizing quantizers (whether uni- 
form or nonuniform), cannot be used, not even under the assumption 
of a constant mean level, because the idle channel noise is higher for op- 
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timum quantizers than for logarithmic quantizers and results in poorer 
subjective performance.*.* The idle channel noise performance is de- 
termined by the smallest reconstruction level 7; of the quantizer. 
Table I lists these values for various optimum three-bit quantizers (the 
term Gauss quantizer refers to a quantizer with an s/n-maximizing 
performance for signals with a gaussian ppF, etc.). The gamma PpF 
is a good model for speech amplitudes, but the smallest reconstruction 
level is 2.4 times higher for the corresponding optimum quantizer than 
for the logarithmic quantizer. 

To overcome the problems of unknown mean level and the variations 
of the instantaneous level, adaptive quantization schemes (AQ schemes) 
can be used. A local estimate o? of the variance of the input signal can 
be calculated; this value controls the gain of an amplifier located in 
front of a quantizer that is optimum for signals with unit variance. Two 
schemes are possible: 


(2) Forward estimation (AQF): The estimation value is calculated 
from samples of the input signal. The input signal must be 
buffered, and the estimation value must be transmitted to the 
receiver in addition to the quantized samples.* ® 

(12) Backward estimation (Ags): The estimation value is calculated 
from quantized samples*~°; therefore, the state of the amplifier 
need not be transmitted (except for synchronizing purposes in 
case of channel errors). 


Figure 1 shows the structures of the different pcm schemes. Note that 
the combination of controlled amplifier and fixed quantizer can be re- 
placed by a quantizer with a step-size adaptation. Matching the gain 
of the amplifier to signal variance results in modifying the PpF of the 
signal to be quantized. It has been shown that different density func- 
tions can be reached by choosing an appropriate forward estimation 
scheme.* To get the best s/n performance, those quantizers can be 
employed that are optimum for the specific PpF. 


Table |— Comparison of the smallest reconstruction levels r; 
of different optimum unit variance three-bit quantizers 


Nonuniform Uniform 
Type of Quantizer Quantizer Quantizer 
Ty T1 
Uniform ppF quantizer — 0.217 
Gauss quantizer 0.245 0.293 
Laplace quantizer 0.222 0.366 
Gamma quantizer 0.149 0.398 
Logarithmic 4100 quantizer 0.062 = 
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2.2 Types of quantizers 


The following types of quantizers were used in the simulation of the 
speech-encoding systems: 


Uniform quantizer with different loading factors. 
Logarithmic quantizer with different loading factors. 


Uniform optimum Gauss quantizer. 
Uniform optimum Laplace quantizer. 
Uniform optimum gamma quantizer. 


Nonuniform optimum Gauss quantizer. 
Nonuniform optimum Laplace quantizer. 
Nonuniform optimum gamma quantizer. 


These optimum quantizers lead to a maximum s/n for the specific 
probability density functions. 


2.3 Algorithms of the AQ schemes 


In applying adaptive quantization schemes, different possibilities 
of controlling the gain of the amplifier have been used. The following 
notation has been employed for the description of the algorithms (see 
also Figs. 1 and 2): 


Symbol Explanation 

x(n) Input sample at time instant n. 

y(n) Quantized sample at time instant 7. 

= Index of quantizer step at time instant n. 

Gn Gain of the amplifier at time instant n (backward 
estimation). 

Gy Gain of the amplifier used in block N (forward estimation). 

M Number of quantized samples used for calculation of G, 

(backward estimation). 
N Number of block. 


NSEG Number of input samples used for calculation of Gy (for- 
ward estimation). 


NF Number of first sample of block N. 
NF = (N — 1)-NSEG + 1. 

ON Vector of short-term autocorrelation coefficients calculated 
from the input samples of block N. 

Ry Toeplitz matrix of short-term autocorrelation coefficients 


calculated from the input samples. 
a, 8B, a;, € Coefficients to be optimized for each algorithm. 


In all aq schemes, a local estimate of the quantizer input signal variance 
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‘ Le 
O O) 





(b) 





(Cc) 


Fig. 1—pom encoding schemes. (a) nonadaptive pcm. (b) Adaptive pcm with for- 
ward estimation (pcomM-aQqF). (c) Adaptive pcm with backward estimation (PCM-AQB). 
x(n) = input signal, y(n) = quantized signal, @ = quantizer, Gc = gain control, 
B -++ GE = buffer and gain estimation. 


is calculated ; this value determines the gain of the amplifier such that 
the quantizer is optimal loaded. 


2.3.1 Forward estimation schemes (AQF) 
In the aqF schemes, the gain is only readjusted once for a new block 
of NSEG speech samples: 


Gy = const.; n=NF,NF +1,---,NF + NSEG — 1 
NF = (N — 1)-NSEG +1. 





Gn , Gn 


Fig. 2—aq scheme. x(n) = input signal, y(n) = quantized signal, G, = gain used 
at time instant n(aqB scheme), Gy = gain used in block NV (aqr scheme). 
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The following algorithms have been used: 


(2) POM: variance scheme®: An unbiased estimation of the variance 
of the block is calculated: 


1 NSEG 
ee NE Tag), (1) 


—2 __ : 
Gy" = © eRG Fe 


Gy is proportional to the inverse of the standard deviation 
estimated from the samples of the block. 

(22) PCM: maximum scheme: The maximum amplitude in the block 
is used: 


Gy} = a-mMmax {|a(NF —]1 + 9) | } -1,-.-,NSEG: (2) 


(211) DPCM: maximum scheme’: The maximum difference between 
neighbored samples is used: 


Gy! = a-max 


{|2(NF — |] + 9) —= z(NF — 2 + j) |} ja2,++.N SEG: (3) 


This algorithm can only be used for predictors with one 
coefficient. 

(tv) ADPCM: variance scheme®®: The vector of short-term autocor- 
relation coefficients is used to calculate an estimation value of 
the variance o% of the difference signal : 


Gy? = a-o4 = a:[o2 — on Ry’ -on | (4) 


2.3.2 Backward estimation schemes (AQB) 


In aqs schemes, the gain of the amplifier is, in general, modified for 
every new input sample by a factor depending on the knowledge of the 
previous quantized samples or of the corresponding quantizer indices. 


Gn = a: G,—1: 
The following algorithms have been used: 


(2) One-word memory scheme®: The last gain value is multiplied by 
a factor that depends on the last occupied quantizer step: 


Gr = F(|Ln|) Gn. (5) 


(22) Variance scheme®": The last M quantized samples and (for 
B&B ~ 0) the last gain value are used to calculate a new gain 
value: 


M 
Ge? = Zan — 9) + B/Gr-1. (6) 
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(272) Modtfied one-word memory scheme’: The gain of the amplifier is 
changed if the smallest reconstruction level has been occupied 
a times or if the largest reconstruction level has been occupied 
once: 


2.0-Gnr1 if |Im| = min for a times (m = n, 
nm—1,-:-,;n—a+l1) 

0.5-G,-1 if vey = Max 

1.0-G,-1 otherwise. 


2.4 Fixed and adaptive predictors 


In predictive encoding systems, an estimate of each input sample is 
calculated and subtracted from the actual input sample; the difference 
is then quantized, encoded, and transmitted. The use of nonadaptive 
predictors (ppcm schemes) leads to a suboptimum overall performance 
of the encoding scheme, because the prediction is not optimum for all 
speakers and for all speech sounds.}:8 

A better prediction can be reached by using adaptive algorithms 
(ADPCM schemes). Two schemes are possible: 


(2) Forward scheme: A short-time autocorrelation function is cal- 
culated using a finite number of buffered input samples. The 
predictor coefficients are readjusted according to the time- 
variant autocorrelation function. ! 

(12) Backward scheme: The predictor is optimized using the quantized 
information (gradient search method and Kalman filter 
algorithm).!? 


Only the forward scheme has been used in the simulations. The 
optimum vector hy of J predictor coefficients for each block N is 


hy = Ry'-ov. (8) 


Ry and oy are the matrix and the vector of short-term autocorrelation 
coefficients calculated from the input samples of block N. The predictor 
coefficients have to be transmitted to the receiver, in addition to the 
code words of the quantized difference signal samples. An upper bound 
of the gain in s/n as compared to PcM is given in Section IV. 


lil. RESULTS 


Various nonadaptive and adaptive encoding schemes have been 
simulated on a digital computer. The signal-to-quantization noise 
ratio (s/n) has been determined using 8-kHz-sampled speech samples 
of one speaker. The same 2.3-s utterance (““The boy was mute about 
his task”; female voice; bandwidth 200 to 3200 Hz) has been used in 
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OUTPUT 
® 





Fig. 3—ppcm scheme. Q = quantizer. 


all simulations. (The simulations have not included any high-frequency 
emphasis of the input speech, as is characteristic of a 500-type set 
transmitter, for example.) The following schemes have been studied: 


(2) Nonadaptive Quantization 
PCM: see Fig. 1. 
DPCM (nonadaptive prediction): see Fig. 3. 
ADPCM (adaptive prediction) : see Fig. 4. 
(iz) Adaptive Quantization 
PcM: see Fig. 1. 
Forward scheme (PCM-AQF). 
Backward scheme (ecM-AQB). 
DPCM (nonadaptive prediction) : see Fig. 5. 
Forward scheme (DPCM-AQF). 
Backward scheme (DPCM-AQB). 
ADPCM (adaptive prediction) : see Fig. 6. 
Forward scheme (ADPCM-AQF). 
Backward scheme (ADPCM-AQB). 


These encoding schemes have been optimized using the s/n as criterion. 





OUTPUT 
O 


Fig. 4—aprcm scheme. B + cE = buffer and coefficients estimator. 
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Fig. 5—ppcm-aq schemes. B + GE = buffer and gain estimator, ec = gain control. 


3.1 Optimum results: three bits/sample quantization 


Figure 7 shows the optimum results reached with a three-bit quan- 
tization of the 2.3-s speech sample. 


Left curves: Optimum results using a fixed quantizer. 

Right curves: Optimum results using an adaptive quantizer. 

Lower curves: Prediction with a first-order predictor (one 
coefficient). 

Upper curves: Prediction with a high-order predictor. 


Pa ee eee 
= == — ADPCM AGF pr a 
| Ge 
= === == ADPCM-AQB i! | | 
|} | B+GE ! 
\ 4 
Q { | b —— 
GC |! 4 
L 


B+ GCE (—) >. 


PTY 


Fig. 6—aprcm-aq schemes. B + Gcr = buffer and gain and coefficients estimator. 
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Fig. 7—Signal-to-noise ratio values and gains (over logarithmic pcm) for different 
three-bit speech-encoding systems. 


Quantizers with a logarithmic characteristic have been used in all 
simulations with a fixed quantizer (curves on the left side of Fig. 7). 
The s/n value for a pcm scheme is 


s/n = 8.7 dB 


if the quantizer has a #100 characteristic, and if the loading is 40, (¢; 
is the standard deviation of the signal to be quantized). As compared 
to this s/n value of 8.7 dB, the following maximum gains can be 
reached with prediction schemes using the same type of quantizer 
(G* is the gain in s/n over pcm): 


Fixed predictor, fixed quantizer: G* 7 dB 
Adaptive predictor, fixed quantizer: G* 11 dB. 


Adaptive quantization (pcM-aq) not only has the advantage of in- 
creasing the dynamic range that the quantizer can handle, but it 
also allows the application of quantizers that are optimum for the 
probability density function of the signal to be quantized. The follow- 
ing gain over the 8.7-dB value of nonadaptive pcm has been reached: 


Adaptive quantization: G* 7 dB. 
Using predictors, the gains over PCM are now 
Fixed predictor, adaptive quantizer: G* 12 dB 
Adaptive predictor, adaptive quantizer: G* = 16 dB. 
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Fig. 8—Comparison of waveforms. r(n) = sequence of input samples (512 sam- 
ples), y(n) = sequence of decoded samples, (7) = sequence of quantization errors, 
Gy = sequence of amplifier gains. 


Figure 8 shows the waveforms of the reconstructed signal and of the 
quantization error for a 64-ms segment of speech. Three examples are 
shown: 


(z) pcM, nonadaptive, 4100 characteristic. Only eight different 
levels can be used for the reconstruction (decoding) of the 
signal. 

(22) PCM-AQF, optimum Gauss quantizer, NSEG = 32. The number 
of levels is limited to eight for each segmeni of NSEG samples. 
Different levels can be used for each segment. 

(272) ADPCM12-AQF, optimum Laplace quantizer, NSEG = 128. The 
predictive encoding with a 12th-order predictor leads to a very 
high s/n. For each segment, the number of levels of the dzf- 
ference signal is limited to eight, but the reconstructed signal 
does not suffer this limitation. 


3.2 Adaptive delta modulation 


To determine whether the quantization schemes represent an 1m- 
provement over existing adaptive delta modulation (apm) schemes, the 
s/n value of Jayant’s apM-scheme!? has been determined at a bit rate 


SPEECH-ENCODING QUANTIZATION 1607 


of 24 kb/s. The s/n value is approximately 15 dB.’ Therefore, the gain 
over nonadaptive three bits/sample Pc is 


Adaptive delta modulation: G* & 6 dB. 


3.3 Entropy coding 


Entropy coding is a variable-length coding procedure that assigns 
short code words to highly probable symbols and longer code words to 
less probable symbols. The average word length 1s approximately equal 
to the entropy of the quantizer output signal. The entropy coding 
technique leads to an additional gain in s/n for a given average bit 
rate. The number of quantizer steps can be increased without exceed- 
ing an average bit rate of three bits per sample. The dashed lines in 
Fig. 7 show the s/n values that can be reached by using an entropy 
coding technique. In this case, uniform quantizers with a large number 
of steps have been employed; the step sizes have been adjusted to 
give a quantizer output entropy of three bits. It should be noted that 
a buffer is needed so that the variable-length coded signal can be 
transmitted over a channel at a uniform bit rate. 


3.4 Optimum results: two bits/sample up to five bits/sample quantization 


Figure 9 shows the s/n values for quantizations with two bits/sample 
up to five bits/sample (corresponding to bit rates from 16 kb/s up to 
40 kb/s). The following encoding schemes have been compared: 


PCM u100 characteristic, 8c, loading. 

PCM-AQF NSEG = 32, optimum Gauss quantizer. 

DPCM 1-AQB 1 predictor-coefficient, fixed; optimum Gauss 
quantizer. 


ADPCM1-aQ@F 1s predictor-coefficient, adaptive; optimum Gauss 
quantizer; NSHG = 32. 

ADPCM4-AQF 4 predictor-coefficients, adaptive; optimum Laplace 
quantizer; NSEG = 128. 

ADPCM12-AQF_ 12 predictor-coefficients, adaptive; optimum gamma 
quantizer; NSHG = 256. 


3.5 Parameter transmission in adaptive encoding schemes 


In all forward schemes, channel capacity is needed for transmission 
of the adaptive parameters. The problems and techniques of quantizing 
these parameters are not considered in this paper. It is known that the 
parameters tolerate coarse quantization and slow updating. If neces- 
sary, redundancy-reducing schemes can be used to lower the number of 
bits that have to be transmitted in addition to the encoded speech 
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s/n IN dB 





PCM PCM— DPCMi— ADPCMi— ADPCM4— ADPCM12-— 
AQF AQB AQF AQF AQF 


Fig. 9—Signal-to-noise ratio values for quantization with two bits per sample 
(16 kb/s) up to five bits per sample (40 kb/s). 


samples. The needed channel capacity can be approximately trans- 
formed into an equivalent loss in s/n. If each parameter of the adaptive 
scheme has to be encoded with NADD bits/segment, and if NSHG 
is the number of samples/segment, then we get an equivalent reduction 
in s/n performance: 


Asin = 6.02 NADD (bits/segment) 


~~ NSEG (samples/segment) oP): (9) 


This loss is due to the reduction of the number of quantizer steps in 
order not to exceed the maximum allowed bit rate. 


Example: 
NSEG = 128 (16 ms) 
NADD = 4 bit. 


The loss is 0.2 dB for each coefficient to be transmitted. 
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IV. UPPER BOUNDS FOR PREDICTION 


The linear dependencies between the amplitudes of the speech 
sample being used in all simulations have been calculated to get a 
measure of the maximum gain that can be reached with linear predic- 
tion. Note that these upper bounds of the prediction gain cannot be 
reached with predictive encoding systems (especially if the quantizer 
has only a low number of quantization levels), because prediction is 
done then with decoded speech samples. These samples include a 
quantization error. 


4.1 Nonadaptive prediction 


The long-term autocorrelation function of the speech signal has been 
measured. Figure 10 shows the first 19 time lags of the normalized 
autocorrelation function p(n). Using these data, a predictor can be 
optimized such that the variance of the difference signal is minimum. 

The prediction gain is the ratio of the variances of the input signal 
and the difference signal: 


IT nad 2 

Gp = 10 logo ape = 10 log io =a (10) 
Gp can be calculated directly from the normalized autocorrelation 
function p(n). Figure 11 shows this gain versus the number of coeff- 
cients being used for the prediction. The maximum prediction gain is 
approximately 10.5 dB. This value is an upper bound of the additional 
gain in s/n over pcm by using nonadaptive differential encoding 
schemes. This gain cannot be reached if the ppcm encoder has to handle 
speech samples of different speakers. In this case, suboptimum pre- 
dictor coefficients have to be chosen such that the ppcm encoder has 


2 4 6 8 10 12 14 16 18 20 
TIME LAG n 


Fig. 10—Normalized autocorrelation function (female voice; 200 to 3200 Hz). 
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Fig. 11—Prediction gains vs number of predictor coefficients. 


a good performance for all speakers. This demand can only be fulfilled 
with predictors of low order (up to three coefficients). It may be 
relevant to mention that such suboptimum predictor coefficients have 
been used in the simulation of the ppcm schemes. 

Knowing the long-term autocorrelation function p(n), it is possible 
to calculate an approximation of the power density function. This is 
done by calculating the power transfer function of a recursive filter, 
the coefficients of which are equivalent to the coefficients of the opti- 
mum predictor (maximum-entropy method"). Figure 12 shows the 
power density spectrum calculated in this way from 16 coefficients of 
the autocorrelation function p(n). 


4.2 Adaptive prediction 


NSEG samples of the input samples are buffered, and the short-term 
autocorrelation function of this segment is calculated. For each seg- 
ment of NSEG samples, the variance of the difference signal can be 
calculated directly from this short-term autocorrelation function [see 
eq. (4) |. Using these variances, a prediction gain can be determined for 
different numbers of predictor coefficients and for different values of 
NSEG. Figure 11 shows the optimum prediction gain for an adaptive 
prediction scheme versus the number of predictor coefficients. In each 
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Fig. 12—Power density of the speech signal. 


case, the optimum value NSEG has been used. The maximum predic- 
tion gain is approximately 14 dB. This value is an upper bound of the 
additional gain in s/n over pcm by using adaptive differential encoding 
schemes. 


V. UPPER BOUNDS FOR QUANTIZATION 


It is possible to design quantizers such that the signal-to-quantizing- 
noise ratio is a maximum; this is done by choosing the quantizer step 
sizes according to the probability density function of the signal. It is 
known that these optimum quantizers cannot be used for the quantiza- 
tion of speech signals: the s/n improvement is offset by the greater idle 
channel noise and smaller dynamic range (Ref. 4; see also Section 
2.1 above). Optimum quantization is practical, however, if used in 
an adaptive quantization scheme; it gives us an s/n advantage over 
logarithmic quantization, and it allows a further increase in s/n by 
using entropy coding techniques (variable length coding). The adaptive 
quantization technique changes the ppF of the signal to be quantized ; 
it has been shown that different density functions can be reached with 
the forward estimation scheme (aar scheme). Table II shows the s/n 
values for three-bit quantizers without and with entropy coding. The 
values of the first two columns are taken from Max" and Paez and 
Glisson.!® In the case of entropy coding, the quantizers have been 
optimized so that the s/n is maximum for the given average bit rate 
of three bits per sample.!* It is not possible to get higher s/n values with 
any encoding scheme based on memoryless single-letter quantization. 
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Table I|— Maximum s/n values of various three-bit quantizers 


Quantizer Without Entropy Coding 
Quantizer With 


Optimum Optimum Entropy Coding 

Uniform Nonuniform 

Quantizer Quantizer 

s/n(dB) s/n(dB) s/n(dB) 
Uniform ppFr 18.06 (18.06) 18.06 
Gaussian PDF 14.27 14.62 16.53 
Laplace PpF 11.44 12.61 17.09 
Gamma PDF 8.78 11.47 18.78 


VI. CONCLUSIONS 


Comparisons of various nonadaptive and adaptive three-bit speech- 
encoding systems via simulation with speech inputs show that a wide 
range of signal-to-quantization-noise ratios can be reached starting 
with 9 dB (logarithmic pcm) and increasing up to 27 dB (adaptive 
predictive coding with adaptive quantization and entropy coding). 
Adaptive quantization has an s/n advantage of 7 or 5 db over logarith- 
mic PCM when used in encoding schemes without and with prediction, 
respectively. Nonadaptive prediction leads to a 7-dB increase in s/n, 
and 11 dB can be gained using adaptive prediction techniques. Entropy 
coding gives an additional 2 to 3 dB improvement; such a coding 
technique is difficult to implement if a constant bit rate has to be 
achieved, but it may be of interest for asynchronous data networks. 
Furthermore, subjectively, ppcm gains over logarithmic PCM are 
believed to be greater than what the s/n gains suggest. 

Informal listening tests have shown that all predictive encoding 
schemes give a very good speech quality when used in connection with 
adaptive quantization (DPCM-AQq or ADPcM-aQ). Differences between 
the original speech and the decoded speech are not audible with adap- 
tive prediction schemes when a high-order predictor is used (for 
example, ADPCM4-AQF). 

The upper bounds that have been determined separately for the 
prediction gains and the quantizer s/n performances cannot be reached 
in practical predictive encoding systems. This fact is attributed to the 
predictor-quantizer interaction; that is, the input to the predictor 1s 
a noisy version of the input signal, and the input to the quantizer is a 
noisy prediction error. This interaction is not negligible when three-bit 
quantizers are used. 

It is important to realize that all results are based on a single speech 
record of one speaker. Computer simulations using other speech 
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material show basically similar results; the main differences appear in 
the exact prediction gains that can be reached. In many instances, 
these gains are higher than those mentioned in this paper. 

One object of this paper was to quantify the (relative and absolute) 
capabilities of a wide range of nonpitch-tracking speech coders. The 
coders studied have a variety of potential applications that call for 
different specifications of speech quality and coder complexity. A 
second purpose of this paper was to study the capabilities of three-bit 
encoding in some detail, as motivated by mobile telephone studies.17:!8 
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The signal-to-noise ratios of different speech-encoding schemes have 
been measured in the case where the channel contains errors. Those types 
and probabilities of errors have been considered that are of interest for 
mobile telephone applications. Most encoding schemes use an adaptive 
three-bit quantizer with an explicit transmission of the step-size informa- 
tion. A scheme with an adaptive prediction algorithm has also been studied. 
It has been assumed tn all cases that the side information about the quan- 
tizer step size and the predictor coefficients 1s transmitted in an error- 
protected format. 

Measurements were made by simulating the coding schemes and the 
noisy channel on a digital computer. The results include upper bounds of 
the umprovements that can be reached with error protection of the most 
significant bits. 


l. INTRODUCTION 


The suitability of digital coding and transmitting speech signals 
for mobile telephone systems is a question of current interest. In UHF 
systems, Rayleigh fading causes the carrier-to-interference ratio and 
the carrier-to-noise ratio to be low in frequent intervals. This leads 
to high bit-error probabilities in the transmission of the coded signal; 
the errors occur in bursts. 

This paper compares the effects of channel transmission errors on 
the objective signal-to-noise ratio (s/n) for different encoding schemes. 
Most of these schemes use an adaptive quantizer with time-varying 
step sizes or, equivalently, a time-varying gain control of an amplifier 
in front of a quantizer with fixed step sizes. The side information about 
the step size (or about the amplifier gain) is derived from stored 
samples of the input signal and has to be transmitted together with the 
message block of coded samples (adaptive quantization with forward 
estimation = aqFr). These aaF schemes have an excellent idle channel 
performance, even in the presence of channel errors, if the side informa- 
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tion can be transmitted in an error-protected format. Previous 
studies! * always assumed a nonadaptive y-logarithmic quantizer with 
its specific problems of allowable peak clipping and changing per- 
formance caused by different mean levels of the speech signal. The 
purpose of the present study is to show the s/n performance of some 
adaptive speech-encoding schemes suitable for mobile telephone ap- 
plications. Our measurements were made by simulating the coding 
schemes and the noisy channels on a digital computer. The measure- 
ments include upper bounds for the s/n that can be reached by using 
error-protection schemes to reduce the effective channel error proba- 
bility. We have studied independent, as well as clustered, channel error 
patterns. No attempt was made to study practical error-detection 
or error-correction schemes. However, the results allow us to predict 
the overall performance that can be reached with nonideal error- 
correction schemes. The s/n values of the coding systems with inde- 
pendently distributed channel errors were measured because it is 
possible to produce a nearly equivalent error pattern by scrambling 
the bit stream at the output of the transmitter.® Additionally, inde- 
pendent errors will be the primary impairment if the interference is low. 

The organization of this paper is as follows. Section II discusses the 
dependence of the total s/n on the bit-error rate P, assuming gaussian- 
distributed quantizer input data. It has been shown‘ that the gaussian 
probability density function is a good approximation for signals oc- 
curring in speech aqgFr schemes. Both the natural binary code (NBC) 
and the folded (symmetrical) binary code (Fsc) are considered. It is 
shown that the FBc code has a better s/n performance if channel 
transmission errors cannot be ignored. Section III discusses those 
speech-encoding systems used in this study, and Section IV considers 
the types of errors on the channel and gives the main results obtained 
by simulating the encoding schemes and the noisy channels on a 
digital computer. Comparison is made on the basis of the objective 
overall signal-to-noise ratio. Tape-recorded examples were used to 
compare the subjective and perceptual effects of signal-quantization 
and channel errors. Some conclusions are given in Section V. 

A detailed comparison of various speech-encoding schemes on the 
assumption of an error-free transmission 1s described in another paper.° 
A companion paper by Jayant® gives numerous examples of the per- 
formance of practical error-protecting schemes in the presence of 
channel errors. 


Il. DERIVATION OF THE SIGNAL-TO-NOISE RATIO FOR GAUSSIAN SIGNALS 


We calculate the overall signal-to-noise ratio of a three-bit Pcm- 
quantization scheme on the assumption that a speech signal can be 
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Fig. 1—Digital transmission system. 
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represented by a gaussian probability density function (ppF) at the 
quantizer input. This assumption is approximately valid if adaptive 
quantization with forward estimation is applied to the speech samples 
and if the message blocks are not too long.* Figure 1 shows a block 
diagram of the system under consideration. The quantizer with M = 8 
steps (and m = logs M bits per code word) maps each input sample 2(n) 
into one of a set of eight rational numbers y(n)e{vz,};=1,2,..-,a- Lhe 
representation level v; is chosen if wiz1 2 x(n) > uz, as illustrated in 
Fig. 2. The index 7 of the input symbol v; of the transmission system 
is transmitted to the receiver in a binary format [binary code word 
y-(n) ]; the received code word 2,(n) is interpreted as one of the eight 
output symbols z(n)e{ wz}z—1,2,...,4. We obtain a change 6,;; = |v; — w;| 
in amplitude if the transmitted quantizer index 7 is changed to 7 
because of channel errors (Fig. 3). The total mean-squared error is 





Fig. 2—Symmetric nonuniform quantizer.m = 3bits, M = 8 steps, NBc = natural 
binary code, rpc = folded binary code. 
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Fig. 3—Channel transmission errors. 
given by 
¢ = E[x(n) — 2(n) P 
= € a €%) (1) 
where the quantization error e? and the channel error «2 are given by 
eg = ELx(n) — y(n) (2) 
ec = ELy(n) — 2(n) J. (3) 


Equation (1) is only true on the assumption of a vanishing correlation 
between quantization error and channel error;’ this is the case if the 
quantizer structure is that of Max® (these quantizers lead to a maxi- 
mum s/n performance for a given PDF). 

The mean-squared error caused by digital line errors is 


M M 
& = DY Pv, wy) 6%. (4) 
i=] j=l 
With 
P(v;, wi) = P(v,)-P(w;/v,), (5) 
we have 
MM 
= 2. zy P (v,;) +P (w;/v;) 87 
t=1 j= 
= trace {P,-P,-8?}, (6) 
where 
P(v,, w;) is the joint probability of an input symbol »; at the 


transmitter and an output symbol w; at the receiver, 
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6;; = |v; — w;| is the amplitude of the error occurring if the input 
symbol v; has been chosen at the transmitter and if 
the output symbol w; has been interpreted at the 


receiver, 

P (v;) is the probability of input symbol v; occurring, 

P(w,;/v;) is the conditional probability that the output symbol 
w; will be received if the input symbol »v; 1s sent, 

P, is a diagonal matrix with elements P (v,), 

P, is the channel transition matrix with elements 
P(w;/v:), and 

3? is the matrix of squared error amplitudes with ele- 
ments 6;;. 


The conditional probabilities P(w;/v;) can be calculated easily if the 
bit errors on the channel are distributed independently. The values 
depend on the bit-error probability P and on the code. The probability 
P(w;/v;) that w; will be received if v; 1s sent 1s just the probability that 
digital errors will occur in the D places where they differ and that no 
errors will occur in the m — D remaining places, 


P(w;/v;) = PP? — P)yr?, (7) 


where P is the bit-error probability on the channel, D is the Hamming 
distance between the code words representing the symbols v; and wy, 
and m is the number of bits per code word. The Hamming distance D 
depends on the code; two codes have been considered (see Fig. 2): 
(2) the natural binary code (NBc), (77) the folded binary code (FBC). 
For this code, the most significant bit gives polarity information; the 
remaining bits represent the signal magnitude in natural binary code. 
For example, the transition from input symbol v; to output symbol 
Wg = Vg Causes an error 61g in amplitude. The Hamming distances are 
D = 3 and D = 1 for the NBc code and FBC code, respectively (see 
Fig. 2). Using (7) we get P(ws/v1) = P® with the nsc code and P (ws/v1) 
= P(1 — P)? with the Fsc code. 

The mean-squared error caused by digital line errors can also be 
calculated if (ideal) error-protection schemes are applied: 


Scheme EP1: The most significant bit of each code word 1s perfectly 
error-protected. 

Scheme EP2: The two most significant bits are perfectly error- 
protected. 


The channel transition matrix P, has to be modified in these cases 
because some elements of the matrix are zero then. These necessary 
modifications are not described in this paper. Using (1) and (6), we 
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obtain the signal-to-noise ratio 


2 


a2 Or 
s/n = 10-logie = 10-logw ace (P,P 


where 
oz = Ela?(n)] (9) 


is the mean-squared power of the input signal. The normalized 
quantization noise variance e7/o% has a value of 0.03451 if the structure 
of the three-bit quantizer is that of Max.® Using this value, the s/n per- 
formance has been calculated as a function of the bit-error rate P 
(Fig. 4). The two lower curves show that the rBc code outperforms the 
nBc code. The folded binary code has therefore been used in all simula- 
tions. The upper curves demonstrate the advantage of an (ideal) 
error protection of the most significant bit (mP1) and of the two most 
significant bits (eP2). It may be relevant to mention that this error 
protection leads to a reduction of the effective bit-error rate. To con- 
firm the theoretical results, we have measured some s/n values with 
the simulation program that has been used for the study of the speech- 
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Fig. 4—Signal-to-noise ratio performance of a gaussian data transmission system. 
NBC = natural binary code, rBc = folded binary code, Ep1 = error protection of 
most significant bit, Ep2 = error protection of two most significant bits. 
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Table |— Channel coefficients of optimum three-bit Gauss 
quantizers. NBC = natural binary code, 
FBC = folded binary code. 


al a2 3 
Optimum nonuniform quantizer 
NBC 6.9124 —2.7373 —0.3133 
FBC 5.7445 —0.4015 —0.3133 
Optimum uniform quantizer 
NBC 7.2113 —3.3611 0. 
FBC 5.5672 — 0.0128 0. 


encoding systems. The results obtained with gaussian input data and 
the FBc code compare favorably with those determined using (8) (see 
Fig. 4). 

The channel error variance é as given in (6) can be transformed into 


eS a ePs (10) 
j=l 


The coefficients a; contain the total information about the effects of 
channel errors on the performance of an encoding-decoding scheme. 
Thus, different schemes can be compared easily. Table I lists the 
a; coefficients of optimum uniform and nonuniform gaussian three-bit 
quantizers. Note that there are only very small differences between 
nonuniform and uniform quantizers. Note also that the rsBc code 
should be chosen; it has a nearly 1-dB advantage over the NBc code 
if the bit-error rate is very high (see also Fig. 4). 

In the discussions so far, a quantizer has been assumed that is 
optimum (in the sense of a maximum s/n) if the channel is error-free. 
A higher overall s/n performance can be reached, however, if the 
quantizer is reoptimized for a given channel transition matrix P,.° 
Figure 5 shows an example with a three-bit quantizer optimum for a bit- 
error rate of 0.025 and the NBc code. In fact, we get a better s/n per- 
formance if the bit-error rates P are high, but the shortcoming is the 
decrease in s/n for low P values; this reduction is approximately 2.3 
dB in the error-free case (P = 0). The reoptimization of the quantizer 
can therefore only be of interest if the channel noise statistics are time- 
invariant, or if the s/n decrease for small error rates can be accepted. 
This will be the case if a quantizer with a higher number of step sizes 1s 
chosen. 

The principal aim of this section was to show the calculation of the 
s/n of a (nonadaptive) pcm scheme if channel errors are present. These 
calculations can be extended to ppcm systems and to burst error 
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Fig. 5—Signal-to-noise ratio performance of two nonuniform gaussian three-bit 
quantizers. (a) Quantizer is optimized for an error-free transmission. (b) Quantizer 
is optimized for independent channel errors of rate P = 0.025. npc code has been 
used in both cases. 


channels. But it seems very difficult to find theoretical solutions for 
coding schemes that apply adaptive quantization and adaptive pre- 
diction strategies. Instead of looking for such solutions, we simulated 
different encoding schemes and channels on a digital computer and 
measured the overall s/n. We shall find some similarities to the results 
we obtained in this section. Particularly, we shall use the channel 
coefficients a; to compare the performance of the coding schemes. 


lil. DESCRIPTION OF THE CODING SYSTEMS 


To get a good quality of the coded speech with low bit rates, we have 
used pcm and differential pcm schemes that employ adaptive quantizers. 
Both nonadaptive prediction (ppcM) and adaptive prediction (ADPCM) 
have been applied. The advantage of adaptive quantization is that 
the quantizer is always adjusted to the highly variable variance of the 
speech signals. Thus, a better s/n performance is achieved.*!° ppcom 
and ADPcM schemes provide an additional s/n gain over pcm; this is 
especially true if the predictor responds to changes in the short-term 
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spectrum of speech (appcm).!" A nonadaptive logarithmic companded 
pcm has been included in our study because it very often serves as a 
standard reference in coder comparisons. A great number of speech- 
encoding schemes have been compared in a companion paper® on the 
basis of s/n as performance measure using the same speech signal 
employed in this paper. The effect of channel errors on s/n performance 
has been studied using the following encoding schemes: 


Scheme 1: PCM, nonadaptwe (Fig. Ga). The quantizer has a »100 
characteristic,’? and the loading is four times the standard deviation 
of the speech signal to be quantized. 

Scheme 2: PCM-AQF (Fig. 6b). Thirty-two samples of the input signal 
are buffered, and the maximum value of this block determines the 
gain of the amplifier in front of the quantizer (adaptive quantization 
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$ B+GE > O 
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Fig. 6—Speech-encoding schemes. (a) Nonadaptive pcm. (b) pcm with adaptive 
quantization (PcM-aQ@Fr). (c) DpcM and adaptive ppcm (appcM) with adaptive quantiza- 
tion (DPCM-AQF and ADPCM-AQF). Q = quantizer, Gc = gain control, B+ GCE 
= buffer and gain and coefficients estimation. 
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with forward estimation = aqFr). The characteristic of the quantizer 
is optimum for signals with a gaussian probability density. 

Scheme 3: DPCM1-AQF (Fig. 6c). A predictor with one time-invariant 
coefficient is being used in connection with adaptive (forward estima- 
tion) quantization (A@F). Thirty-two samples of the input signal are 
buffered, and the maximum difference between neighbored samples 
determines the gain of the amplifier in front of the quantizer. The 
characteristic of the quantizer is optimum for signals with a gaussian 
probability density. The predictor coefficient that is optimum for the 
speech signal being used is hy = 0.85. Lower values lead to a better 
performance of the ppcm scheme in the case of high error probabilities ; 
this will be shown in Section V. 

Scheme 4: ADPCM1-AQF (F 1g. 6c). In this adaptive prediction scheme, 
32 samples of the input signal are buffered, the normalized short-term 
correlation coefficient between neighbored samples of this block is cal- 
culated, and the predictor coefficient is set to this correlation coefficient. 
The gain of the amplifier in front of the quantizer is determined by 
calculating an estimation value of the standard deviation of the differ- 
ence signal; the amplifier gain is set to the inverse of this estimation 
value. The characteristic of the quantizer is optimum for signals with 
a gaussian probability density. 

Scheme 5: ADPCM4-AQF (F 1g. 6c). In this adaptive prediction scheme, 
four optimum predictor coefficients are calculated for each segment of 
32 samples from the first values of the short-term autocorrelation 
function; see the description of Scheme 4 for further details. 


The folded binary code (FBc) was used in all cases. It should be 
mentioned that the combination of controlled amplifier and fixed 
quantizer in the adaptive quantization schemes is equivalent to a 
quantizer with a step-size adaptation. Some adaptive quantization 
schemes that use the transmitted code words for the control of the 
amplifier gain (adaptive backward estimation) have also been studied. 
The simulations have shown that these schemes cannot be used for 
channels with high bit-error probabilities; the overall s/n turned out 
to be less than 0 dB in most cases. 


IV. ERROR PERFORMANCE: RESULTS AND DISCUSSION 
4.1 Simulation system and types of errors 


The dependence of the overall signal-to-noise ratios of five speech 
coding schemes (see Section IiI) on the average bit-error probability 
P has been determined for different types of noisy channels. The s/n 
is given by 


2 
s/n(P) = 10 logio “3 (11) 
t 
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where ¢? and o? are defined in (1) and (9), respectively. The s/n values 
have been measured for bit-error rates of 0, 0.001, 0.0125, 0.025, and 
0.05. The measurements were made by simulating the coding schemes 
and the noisy channels on a digital computer. Channels with inde- 
pendent, as well as correlated, error patterns have been studied. The 
statistically independent errors have been generated by using a pseudo- 
random noise generator program. Tape recordings with error patterns 
of actual fading signals have been used in the channel simulation of 
burst errors. The error patterns are typical for UnF mobile radio trans- 
mission. The statistics of these errors are described in Ref. 6. In all 
simulations, it has been assumed that it is possible to transmit the 
information about the gain of the amplifier (Aer scheme) and/or about 
the predictor coefficients (adaptive prediction) without any error. 
Increased signal-to-noise ratios can be reached for a given P value 
using error-detection and error-correction schemes. In studying these 
error-protected cases, it has been assumed that all errors are corrected. 
Practical schemes will not always be able to correct all errors. There- 
fore, the s/n values given in this paper for the error-protected case 
represent an upper bound on the performances of error-protecting 
techniques. Two types of error correction have been studied: 


EP1: Protection of the most significant bit; this bit is the sign bit. 

EP2: Protection of the two most significant bits. Only changes to 
neighbored output symbols are possible in this case (if the 
quantizer has eight step sizes). 


A 2.3-second utterance (‘‘the boy was mute about his task’’; female 
voice; bandwidth 200 to 3200 Hz; sampling rate 8 kHz) has been used 
in all simulations. 


4.2 Results 


The s/n performances of the coding schemes that have been de- 
scribed in Section III have been measured using three-bit quantizers 
and the folded binary code. Figures 7 to 11 show the measured depend- 
ence of the s/n on the average bit-error rate P in the case of burst errors. 
The lower, middle, and upper curves refer to the unprotected trans- 
mission and to the Ep1 and Ep2 error-protection schemes; note that 
the effective bit-error rate is reduced then. We show this using Fig. 9 
as an example. The s/n value for P = 0 is due to the quantization 
noise only. The lower curve shows a considerable loss in s/n for high 
bit-error rates P. An increase in s/n can be obtained by protecting the 
most significant bit (eP1; middle curve) or the two most significant 
bits (eP2; upper curve). For example, if P = 0.025, the s/n value 
without error protection is 10.4 dB. A value of 14 dB is obtained with 
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Fig. 7—Signal-to-noise ratio performance of a three-bit log pcm scheme in the 
presence of correlated errors. 


the protection of the most significant bit (mp1). The effective bit-error 
rate is reduced to 2 X 0.025 = 0.0167 in this case because 4 of the 
errors are assumed to be corrected at the receiver. The 14-dB value of 
the EP1 curve is 2 dB higher than the s/n value we get for P = 0.0167 
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Fig. 8—Signal-to-noise ratio performance of a three-bit pcM-agr scheme in the 
presence of correlated errors. 


1626 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 


s/n IN dB 





0 0.01 0.02 0.03 0.04 0.05 
BIT ERROR RATE, P 


Fig. 9—Signal-to-noise ratio performance of a three-bit DPcM1-A@F scheme in the 
presence of correlated errors. The predictor coefficient has a value hi = 0.6. 


on the lower curve (no error protection). We expect this result because 
no errors occur on the most significant bit for the effective 0.0167 bit- 
error rate of the EP1 curve. An s/n value of 16.9 dB is obtained with the 
protection of the two most significant bits (ep2). The s/n value for 
the effective bit-error rate of 4 X 0.025 = 0.083 is 14.3 dB if the errors 
occur on all bits of the code words (lower curve); therefore, a 2.6-dB 
increase In s/n is due to the fact that only the least significant bits are 
affected. 

Error protection, however, is only possible by inserting redundancy 
into the code words. Let us assume that it is possible to obtain an 
error protection of the two most significant bits by using three re- 
dundant bits for each three-bit code word. The total bit rate is now 6 
bits per sample. Let us further assume that doubling the transmission 
rate causes doubling the bit-error rate (this is true for phase-modulation 
systems). Again using Fig. 9, we find an s/n value of 15.2 dB for 
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Fig. 10—Signal-to-noise ratio performance of a three-bit appcm1-aqF scheme in the 
presence of correlated errors. 


P = 0.05 (ep2 curve). On the other hand, the s/n value without error 
protection is 10.4 dB for P = 0.025. Therefore, an improvement of 
nearly 5 dB over the transmission without error protection has been 
obtained. A similar discussion using the EP1 values shows that we get 
only a small s/n advantage then: an error protection of the sign bit is 
not sufficient for improving the overall performance. 

To better compare the performances of the coding schemes in the 
presence of errors, we have plotted the s/n values of these schemes with 
P as a parameter (Fig. 12). The s/n values for P = 0 are due to the 
quantization errors only; the increase in s/n as compared to log-pcm 
starts with 7 dB (pcm-a@F) and goes up to 14 dB (appcm4-aqF). At 
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s/n IN dB 





0 0.01 0.02 0.03 0.04 0.05 
BIT ERROR RATE, P 


Fig. 11—Signal-to-noise ratio performance of a three-bit appcM4-aqrFr scheme in the 
presence of correlated errors. 


higher bit-error rates, the different coding schemes have approximately 
identical performance because the output noise due to channel errors 
predominates over the quantization noise due to the quantizer. 
Recall from (1) that the total error variance can be expressed as the 
sum of the quantization error variance é? and the channel error variance 
é. if the mutual error is neglected. The term ¢«; can be determined from 
the s/n for P = 0; hence, we can separate the values ¢«% for the four 
bit-error rates P that have been used in the simulations; these are the 
values P = 0.001, 0.0125, 0.025, and 0.05. The channel error can be 
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s/n IN dB 





PCM PCM— DPCMI— ADPCMI— ADPCM4— 
AQF AQF AQF AQF 
hy = 0.6 


Fig. 12—A comparison of the s/n performance of three-bit encoding schemes at 
different bit-error rates P (correlated errors). 


expressed approximately as 
@&=a,P + ak (12) 


because the third term of (10) can be neglected. 

The coefficients a; and a, have been determined using the measured 
data by searching for the minimum of the mean-squared differences 
between measured and calculated s/n values. These coefficients a; and 
a2 describe the effect of channel errors on the performance of the coding 
schemes. The total channel error variance is mainly determined by the 
channel error coefficient a,. Figure 13 shows that the a; values of those 
encoding schemes that use the same gaussian quantizer (all aQr 
schemes) are not very different. From this, we conclude that trans- 
mission errors are no more serious for ppcM and appcm schemes than 
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for pcM; this has already been mentioned for ppcom in Ref. 2. The 
channel error performance is better for burst errors than for inde- 
pendently distributed errors. This has partly to do with the fact that 
some bursts appear in low-level parts of the speech sample. On the 
other hand, we cannot neglect the a2 term in the case of burst errors; 
a2 1s the coefficient of the P? term in (12); this term mainly represents 
the channel error contribution caused by two bit errors in a code word. 


TRANSMISSION WITHOUT ERROR-—PROTECTION 
as PROTECTION OF THE SIGN-BIT (EP1—SCHEME) 





PCM PCM— DPCMi— DPCMi— ADPCMi—~ ADPCM4— 
AQF AQF AQF AQF AQF 
hy = 0.85 hy = 0.6 


Fig. 13—Channel coefficients a: and ag. J = independent errors, B = burst errors. 
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This contribution is higher in the case of burst errors; it causes a 
stronger decrease in signal-to-noise ratio as can be seen in Fig. 14 
showing the s/n performance of a ppcM1-AQF scheme both for inde- 
pendent and correlated errors. We find the same tendency if we protect 
the most significant bit (eP1; see Fig. 13) or the two most significant 
bits. We have used the average a; values to calculate the s/n increase 
if we apply (perfect) error protection: the increases are approximately 
3 and 11 dB for the mp1 scheme and Ep2 scheme, respectively. Note, 
from Fig. 13, that the a, term can be neglected in the case of Pom 
schemes; we therefore have a slower decrease in s/n at high bit-error 
rates. This can also be seen from Fig. 15, where we compare the s/n 
performance of a PCM-AQF scheme with ppcm1-aqF schemes that have 
different values of the predictor coefficient. Lowering this value, we 
obtain a higher channel error resistance, but PcM-AQF outperforms the 


s/n IN dB 





0 0.01 0.02 0.03 0.04 0.05 
BIT ERROR RATE, P 


Fig. 14—Signal-to-noise ratio performance of a three-bit ppcmi-agr scheme in the 
presence of independent errors (J) and burst errors (B). The value of the predictor 
coefficient is hi = 0.6. 
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Fig. 15—-Comparison of the s/n performance of ppcM1-AQFr schemes with different 
values h, of the predictor coefficient with a pcM-aQF scheme. 


DPCM1-a@F scheme if the predictor coefficient is too low (note the 
different slopes of the curves at high bit-error rates). 

Our simulations involved not only three-bit quantization but also 
quantization schemes with a greater number of step sizes. Figure 16 
illustrates a typical example: the signal-to-noise ratios of the four-bit 
quantization schemes are nearly 6 dB higher than the signal-to-noise 
ratios of the corresponding three-bit quantization schemes if the 
channel is error-free. But this increase is lost in the presence of high 
channel error rates: all three-bit and four-bit systems have a similar 
s/n performance for high bit-error rates. Higher s/n values can only 
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s/n IN dB 





PCM— PCM— DPCM1— ADPCM1— ADPCM4— 
AQF AQF AQF AQF AQF 
h, =0.6 


Fig. 16—-Comparison of three- and four-bit encoding schemes. 


be reached by error protection, that is, by reducing the effective bit- 
error rate. 


V. CONCLUSIONS 


In this paper, we have determined the s/n performance of various 
speech-encoding schemes in the presence of high bit-error rates (up 
to 5 percent); both independent and correlated error patterns have 
been used. Some conclusions can be drawn from the quantitative results 
that have been obtained and from informal subjective listening tests. 


(z) It is possible, without pitch tracking, to quantize speech signals 
with three bits per sample such that the decoded signal is nearly indis- 
tinguishable from the original signal (adaptive prediction schemes in 
connection with adaptive quantization). A simple scheme with a fixed 
predictor (one coefficient) and an adaptive quantization can be chosen 
for a bit rate of four bits per sample. 

(22) Adaptive quantization lowers the idle channel noise and in- 
creases the s/n (the three-bit quantizer with a logarithmic character- 
istic has a very poor performance). 
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(222) Only the adaptive quantization schemes (AQF schemes) with 
an explicit error-protected transmission of the step-size information 
can be used in the case of high channel error probabilities. 

(7v) The folded binary code (FBc) outperforms the natural binary 
code (nsC) for large bit-error rates. 

(v) Burst errors cause a stronger decrease in signal-to-noise ratio 
for large bit-error rates than independent errors. 

(vt) Transmission errors are no more serious for DpcM and ADPCM 
schemes than for PcM. 

(viz) All coding schemes show approximately the same s/n per- 
formance for high bit-error rates, because the contribution of the noisy 
channel to the total error is much higher than the contribution of the 
quantizer. Therefore, a better s/n performance can only be reached 
by using error-protection schemes, not by increasing the number of 
quantizer steps. 

(viz) A high-quality decoded signal can be obtained with a protec- 
tion of the two most significant bits. An improvement in decoded 
signal quality can be realized even if a doubling of the bit-error rate 
(caused by the higher transmission rate) has to be tolerated. 

(iz) Significant-bit-packed codes that provide only protection of the 
sign bit (EP1 scheme) are not very efficient. 

(cz) Adaptive quantization schemes suppress the idle channel noise; 
therefore, channel errors produce decoded noise only with very small 
amplitudes in silent intervals. This fact makes the decoded speech 
perceptually more pleasing. 

(xt) Nonadaptive and adaptive prediction schemes have a better 
perceptual quality than pcm schemes when bit errors occur on the 
channel. The power density spectrum of the error sequence is shaped 
in the ppcm or appcm feedback loop such that the main contribution 
of the error is in the low-frequency range. This error spectrum is 
perceptually less objectionable. 


It is important to realize that the numerical results of this paper are 
based on a single speech record of one speaker. However, we expect the 
broad conclusions of this paper, as summarized above, to be true of a 
wide range of input speech material. 
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A Diffusion Model Approximation for the 
GI/G/1 Queue in Heavy Traffic 
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For the single-server queue with renewal input, we obtain heavy traffic 
approximations for the tume-dependent distributions of queue length and 
virtual delay by constructing approximating diffusion processes. These 
approximations are shown to agree with known limiting cases, and a com- 
parison 1s made with results from a computer simulation. 


I. INTRODUCTION 


We consider a single-server queuing system where the interarrival 
times are independent and identically distributed (i.i.d.) random 
variables, customers are served in order of arrival, the service times 
of the various customers are 1.1.d. random variables, and the inter- 
arrival and service times form independent sequences. Lindley! ob- 
tained a recursive equation for the delay-time of the nth arriving cus- 
tomer, an integral equation for the delay time of a customer in the 
steady-state, and conditions for the latter to have a nondegenerate 
limit. 

Lindley’s equations have not yielded to conveniently used analytical 
solutions, except in some special cases, stimulating a search for ap- 
proximations to the distributions of general interest. In this paper, we 
approximate the queue length and delay processes by appropriately 
chosen diffusion processes. This method of approximation appears to 
have been introduced by Gaver? and Newell.? Gaver and Newell con- 
sidered the M/a/1 queue; we extend their approximate models to the 
a1/G/1 queue. Other methods for obtaining diffusion approximations 
for queuing processes involve applying the theory of weak convergence 
to sequences of approximating processes. Whitt‘ is a survey of these 
methods and contains an extensive bibliography. We show that the 
diffusion models developed in this paper agree with the limiting 
processes obtained by weak convergence methods. 

Important features of this paper are the use of the M/M/1 queue to 
motivate a diffusion process approximation for the single-server queue 
and the use of elementary renewal theory results to obtain the param- 
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eters of the process. This approach provides an intuitive explanation 
for the limit theorems. 


i. PRELIMINARIES AND NOTATION 


Let 7; be time between the arrival epochs of the (z — 1)st and ith 
customers, 7 = 1, 2,---, assume these random variables are 1.i.d., 
and let \~! = E(T;) and oi = Var (7,1). Assume the customers are 
served in order of arrival, let S; be the service time of the zth customer, 
w= E(S8,), o% = Var (S;), and assume S,, So, --- are ii.d. random 
variables. We define the traffic intensity by p = \/y and will always 
assume p < 1. We seek approximations for the queue size and virtual 
delay at time ¢, and obtain these approximations from suitably chosen 
diffusion processes. For any function F, let F, = 0F/dz, F,, = 0F/ 
6x*, F, = OF /dy, etc., and unambiguous arguments will be suppressed. 

Let {X(t),é 2 0} be a homogeneous and additive diffusion process, 
F(t, 23%) = Pr {X(t) S x|X(O) = ao}, and a and b be the infinitesi- 
mal mean and variance of the process, respectively. Then F satisfies 
the forward Kolmogorov (Fokker-Planck) equation 


Fi = —aF. +3 Fen (1) 
with initial condition 
np) — JO fe < aol. 
F(0, x; xo) ~ : if x > veh (2) 


If the range of X(t) is [0, ©), then (1) is subject to the boundary 
condition 
F(t, 0; 20) = 0, t> 0. (3) 


The solution to (1) subject to (2) and (8) is given in Newell? as 


ieee (=3-*) seat (== 2=*), (4) 


where 
&(z) = = ew idy 
27 J—« 


is the normal distribution function (d.f.). If a < 0, then 
F(x) = lim F(a, t; 40) = 1 — e722 9)/®, (5) 


t>0 
which is the negative-exponential d.f. and is independent of 2». 
Returning to the a1/a/1 queue, let W; denote the delay of the zth 
customer. Lindley showed that when p < 1, W = lim;.... W; 1s non- 
degenerate. Kingman‘ showed that if p is slightly less than unity, the 
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d.f. of W is approximately negative-exponential with mean 
3(o%4 + o%)/(X* — we"). (6) 


lil. AN APPROXIMATION FOR THE M/M/1 QUEUE 


To motivate the diffusion model employed in approximating the 
a1/G/1 queue, and to indicate its efficacy for the M/M/1 queue in par- 
ticular, we first develop an approximation for the M/M/1 queue. A 
scheme suggested by Bailey’ for approximating stochastic processes is 
used. 

In the M/M/1 queue, customers arrive according to a Poisson process 
with rate \, and the service-time distribution is negative-exponential 
with mean po. Let N(t) denote the number of customers in the queue 
at time t, and w(t, 2; 20) = Pr {N(t) = n|N(O) = no}. For ¢ > O and 
n=1,2,---, r(t, n; No) satisfies 


oat N;No) = Ar(t,n — 13; m0) + ur(t, n + 1; no) 
— A+ p)r(t, 0; 20), (Va) 


and 
o a(t O; no) = um(t, 1; no) — A(t, 0; no). (7b) 
The initial condition is 
1 (0, n; No) : if ns 4 ’ (8) 
and the boundary condition is 
r(t,n;no) =O n <0, t= 0. (9) 


The idea of the approximation is to replace (7) by a partial differ- 
ential equation that is easier to solve. We do this by replacing the dis- 
crete variable n by the continuous variable x, and w(t,n; no) by 
p(t, 2;2%o) in (7). Expanding in a Taylor’s series about the point 
(t, x; 2%) and keeping only first- and second-order terms, we obtain 


A 
p= —-Q—wp2t*t#p..,  4t>0. (10) 


If we define 
P(t, ©; Xo) =| p(t, y; Lo)dy, 


— 


it can be easily shown that P also satisfies (10). We take 


1 if t= xX 


P(O, x; 2%) = | (11) 
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and 
P(t, 0; x0) = 0, t > 0, (12) 


as natural replacements for (8) and (9). This system of equations is 
identical in form to (1), (2), and (3), so P(t, x; xo) is given by the right- 
side of (4) witha =X —pwandb=dA+u. 
Consider now the asymptotic behavior of r and P: From the theory 
of the M/M/1 queue, we have 
Tr = lima(t,n; no) = (1 — p)p*™! for n>O 


to 


and from (5) we obtain 


P(t) = lim P(t, 2; 20) = 1 — exp L—2(1 — p)/(1 + p) J. 
If for n > 0 we approximate 7, by 


ee / " aP (8), 
n—1 


we obtain 
Tn = [1 = e~2—p) 1 +p) Je—2(n—1) (ep) / +p) 


= (1 — a)a™, n>Q, 


where a = exp [—2(1 — p)/(1 + p) J, so t, has the same form as tp. 
If p is close to unity, a = p and hence 


~ 


Tn = Tn, p = 1, 


SO 7, 18 a good approximation of 7, when p is slightly less than one. 


IV. THE GI/G/1 QUEUE—APPROXIMATE QUEUE LENGTH 


Let us first consider a heuristic manner of obtaining (10) for the 
M/mM/1 queue. During the time interval (t,¢-+ At], the number of 
customers in the system changes by the number of arrivals minus the 
number of service completions, and when N(t) = n > O, this change 
has expectation (\ — w)At + o(At) and variance (A + yu) At + o(At). 
To approximate {N(t),¢ 2 0} by a diffusion process with the same 
infinitesimal mean and variance, set a = \ — vw andb = A} + pin (1), 
which yields (10). This suggests that an appropriate choice of a and 6 
will yield a good approximation for the queue length of the e1/e/1 
queue. 

For the G1/G/1 queue, let N(¢) be the queue size at time t, A(é) and 
D(é) the number of arrivals and departures in (0, ¢ ], respectively ; then 


N(t) = N(O) + A@ — DQ, t> 0. (13) 
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For any renewal process {/(t),t = 0} where the interevent times 
have mean m and variance JV, for large values of ¢ 


EL M(t) ] & t/m, (14) 
and. 
Var [M()] 2 tV/m, (15) 
(Ref. 8). By hypothesis {A(¢), ¢ 2 0} is a renewal process, so from 
(14) and (15) we obtain 


ETA(t)] 3M, Var [A()] & N03. (16) 


The process {D(t), t = 0} is not a renewal process, but, in heavy 
traffic (p close to 1), the server will be occupied most of the time, so 
we approximate D(t) by D(t), where 


ECD (t)] & ut, Var [D(t)] && p°0?, (17) 


Cox and Smith® use (13), (16), and (17) to study the a1/a/1 queue 
for small values of ¢ without using a diffusion model. Substituting 
D(t) for D(t) in (13) and using (16) and (17), we obtain the approxi- 
mate results 


ll: 


lim E[N(t)]/t = — (18) 


and 
M3o% + u3o%, (19) 


lim Var [N (t) ]/t 


which suggest that we approximate N(t) by a diffusion process N (2), 
say, with infinitesimal mean and variance given by 


a=rv\—u (20) 
and 
b = V0% + pod, (21) 


respectively. If we let F(t, x;20) = Pr {N(t) S x|N(O) = 20}, then 
F satisfies (1), (2), (8), and hence is given by (4), with a and 6 as above. 

As a partial check on the efficacy of this approximation, let us define 
N = lim,... N(t). For p < 1, N is a proper random variable, and from 
(5), (20), and (21), 


E(N) = a + Rp me 


» ce os pot ) 


(22) 


which together with the queuing formula L = \W (see Ref. 10) and 
p = 1 yields the heavy traffic approximation given by (6). 


V. THE GI/G/1 QUEUE—APPROXIMATE VIRTUAL DELAY 


The virtual delay at time ¢t is the delay in queue a customer would 
experience if it arrived at time ¢; an exact formulation is given by 
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Benes." Toward developing an approximation of the virtual-delay 
process, define 


LQ) = 81+ S2+ +--+ + Saw; i> 0, (23) 


so L(t) represents the amount of work time brought to the server in 
(0, ¢]. Since Sy, Se, +++, Sac are ii.d., using (17) we obtain 


ELLY) = E(S1)ELAO I] pt (24) 


for large values of ¢. Using the conditional variance relationship 
Var [L(t)] = E{Var [L()|A() J} + Var {ELL()|A@ J} and (17), 
we obtain for large values of ¢, 


Var (L(t) | & A(o% + p?o% de. (25) 


The sample paths of the virtual delay process are sawtooth functions 
with a jump of size S; at the arrival epoch of the zth customer followed 
by a decline of slope —1; the process has an impenetrable boundary 
at the axis of abscissas. Assume that (24) and (25) hold for all é, so that 


a= him. {ELL(é + At)] — ELL) J]}/At = p (26) 
and 
B= im, {Var [L(é + At) ] — Var [ L(t) J} /At 


\(o% + po). (27) 


Following Gaver,? we approximate the virtual-delay process by a 
diffusion process {V(t),¢t 2 O}, say, with infinitesimal mean and 
variance given by 


=p—1, b= X(o% + p’o4), (28) 


respectively. Hence, the time-dependent d.f. of V(é) is given by (4) 
with a and b given by (28). 

Turning now to asymptotic results, when p < 1, V = lim: V(é) 
exits and is proper, and from (5) has a negative-exponential distribution 
with mean 

E(V) = 3(o% + p?o%)/(X7) — wo). (29) 


Hooke” showed that, if the interarrival times are nonlattice, then 
as p71, Pr{W Ss x}—Pr{V S x}, where V is the virtual delay in 
the steady state. This result and (29) together show that, as p Tf 1, 
E(V)/E(W) 7 1, and the d.f. of V agrees with (6). From (22) and (29) 
we observe that E(V) < E(N)/p when p <1, but E(V) 2 E(N)/z 
is known to hold. 
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We note that once a diffusion model for the virtual-delay process 
is at hand, the method given in Heyman*® for approximating the busy- 
period distribution can be applied. 


Vi. A NUMERICAL EXAMPLE 


In this section, we compare E[ N(t) | to the results of a single com- 
puter simulation. We consider a single-server queue that 1s empty at 
time zero, where the interarrival times are uniformly distributed from 
0 to 20 minutes, and the service times are uniformly distributed from 
0 to 19 minutes. Thus, 


M4= 10, of = 100/83, wit=9.5, 0% = (9.5)2/3; (30) 


hence, p = 0.95 and from (22) we obtain E(N) = 6.50. Since the d.f. 
of N is negative-exponential, the standard deviation of WN is also 6.50. 

Let N;(t) denote the sample of N(¢) produced by the jth simula- 
tion run, N; be the estimate of N produced by the jth run, and 
N= n> %_,N;. Assuming the simulator produces independent 
sample paths and approximating the mean and variance of N by those 
of N, standard sampling theory indicates that, for reasonably large 
values of n, NV has a normal distribution with mean 6.5 and standard 


deviation 6.5/Vn. Hence, given c > 0, the number of runs required to 
have 


Pr {|N — E(N)| S cH(N)} = 0.99 


is the smallest integer = (2.575)?/c; choosing c = 0.05 yields n = 134. 
These considerations lead us to simulate 150 sample paths, and indi- 
cate one of the potential uses of even a crude diffusion approximation. 
Gaver? and Newell’ observe that if one makes the change of vari- 
ables £ = —(a/b)x and r = (a?/b)t, (1) becomes 


F, = —F; + Fx/2, (31) 


which can be solved once and for all; the solutions for any a and 6 can 
be recovered by scaling. For our example, — (b/a) = 13.00 and b/a? 
= 2472.2 minutes. Table 2-2 in Gaver? gives the values of 


i: ” del? (1, £30) 


for 7 = 0.1, 0.2, ---, 1.0. These were used to construct Table I below. 

The approximations shown in the table are not as accurate as the 
diffusion approximations for M/G/1 shown in Table I of Gaver?; these 
comparisons share with Gaver’s table the property that the diffusion 
process consistently overestimates the mean queue-size. 
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Table |— Comparison of diffusion approximation 
and simulation results 


Time T 


E(N (t)] 0.1 |; 02 | 038 | 04 | 05 | 06 | 0.7 | 0.8 | 0.9 1.0 


NT ee re et Cee nll beeen 


Diffusion 2.68 | 3.50 | 4.02 | 4.40 | 4.68 | 4.91 | 5.11 | 5.27 | 5.41 | 5.52 
Simulation | 2.25 | 2.91 | 3.40 | 3.71 | 4.16 | 4.39 | 4.61 | 5.05 / 5.18 | 5.13 


Vil. COMPARISON WITH THE RESULTS OF IGLEHART AND WHITT 


Iglehart and Whitt 15 use the theory of weak convergence of 
stochastic processes to obtain heavy-traffic-limit theorems for large 
classes of queuing processes. In particular, these theorems hold for the 
c1/G/1 queue. Their results are obtained by considering a sequence of 
a1/G/1 queuing systems, indexed by n (n = 1,2, ---). For the nth 
system, let \, and u, denote the arrival and service rates, respectively, 
and N,,(t) and V,(¢) denote the queue size and virtual delay at time 
t, respectively. For the above quantities, let the absence of a subscript 
denote the limit with respect to n, e.g., \ = limnsaoAn. For each n, let 
o*, and o% be as before, and assume NV,(0) = V,(0) = 0. 

Section three of Iglehart and Whitt!* shows that if 


limnso (An — fn) Vn = c, where c is some finite constant, then 


tim ee =>B(,—¢/y), (32) 


where y = \°¥o% + u'c%, Bt’, —c/y) is the Wiener process with nega- 
tive drift c/y together with an impenetrable barrier at the origin and 
= denotes weak convergence. From (82) it follows that for large 
values of n, 1.e., when pn = An/pn 1S close to |, 


Pr {Nn(nt’)/yvn S d} 


Since N,(t) = N(t), c = (A — p)Vn for large n, upon making the 
change of variables x = dyvn and t = nt’, we obtain 


Pr {N(t) Ss x} 


a (2 - a nr) _ g2Q—n) alg (=) (34) 


from (33). Observe that the right-hand side of (34) agrees exactly with 
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the time-dependent distribution (when x) = 0) of N(é) obtained in 
Section IV. 

Let an = pnt — rat and o = o4 + 0%. From theorem 6.1 of 
Iglehart and Whitt,!° it is possible to derive the following result, which 
appears in theorem 4 of Whitt.!® If ano vn => —kj -wx» <k< oe, 
then 

/ 
Nigh) sgl iy) (35) 


TVNAn 


For large values of n, an = (op — 1), —k = (p — 1)d7o7!Wa, 
Vt) = V(t), and 
Pr {V (nt!) S davVyn} 


i+ ksh) 7 Gie25 
= @( |) — eg tkdvag ( —— |. (86 
(a ; W en) 


Letting ¢ = nt’ and x = devyn in (35), we obtain 


Pr {V(@) S 2} 


(Se) loz) dag (4 ): (37) 


The right-hand side of (36) represents the time-dependent distribu- 
tion (4) with x») = 0 and 


a=p—I, b=X(o4 + 95), (38) 


— 
—w= 


1 


which is the same value of a and almost the same value of 6 given in 
(28), where the difference between the values of 6 given in (28) and 
(88) vanishes as p —> 1. 

From the results of this section, we can conclude that the heuristically 
constructed diffusion models for N(¢) and V(é) given in this paper can 
be regarded as the limit (in the sense of weak convergence) of N (#) 
and V(t), respectively, with suitable normalization. 


VIII. SUMMARY 


We have obtained an approximation for the time-dependent dis- 
tributions of queue length and virtual delay in a a1/a/1 queue using 
a diffusion model. The diffusion model for the queue length process 
was obtained by using the mean and variance of the asymptotic rate 
of change of an approximation of the queue-length process as the 
infinitesimal mean and variance for a diffusion process. The diffusion 
approximation was compared to a simulation of a particular queueing 
process, and reasonably close agreement for mean queue lengths was 
obtained. The approximation for the virtual-delay process was gen- 
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erated In a manner suggested in Gaver? and the limiting results it 
provided were shown to agree with a theorem of Kingman® for the 
delay process. The time-dependent distributions for the approximate 
queue length and virtual-delay processes agree, as the traffic intensity 
approaches one, with the limiting results of Iglehart and Whitt.'445 
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Optimal Rearrangeable Graphs 
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Many important properties of switching networks can be effectively 
studied in the more general context of graph theory. In particular, the 
various rearrangeability properties of a network fall into this category. 
If G ts a graph with verter set V = IU Q, we say G ts rearrangeable 
af, for all choices of distinct vertices, 11, t2, +++, %,7n LT and Ji, Jo, +++, Jem 
Q, there exist vertex disjoint paths between 1, and jx for all k. In this 
paper, we determine the minimum number of edges any rearrangeable 
graph may have for all choices of I and Q. We also discuss generalizations 
an which V ts strictly greater than I UU Q and/or t is bounded by a pre- 
determined value. The minimal rearrangeable graphs we construct can 
be used to form efficient rearrangeable (and nearly rearrangeable) switching 
networks of arbitrary size. 


I. INTRODUCTION 


Let G be a finite graph with vertex set V(G) and edge set H(G).* 
Let J and Q be nonempty subsets of V(G) (not necessarily disjoint) 
and let S denote the set 


ViQ\ZUQ) = {fv € VG |[vEL U Q}. 
We use the following terminology: 


(1) A request is an ordered pair (2, y) with « EC J, y EQ, and 

LF Y. 

(21) A set of requests is called an assignment if each vertex in the 
set occurs once at most. 

(272) An assignment A is called realizable in G (or we say G satisfies 
A) if we can find a set of vertex-disjoint paths connecting x 
and y for each pair (xz, y) in A. 

(2v) A graph G is said to be rearrangeable if G satisfies any assign- 
ment. 


The problem we consider, first suggested by F. K. Hwang,? is to find 


“Le., £(G@) consists of a prescribed set of unordered pairs of distinct elements of 
some finite set V(G). Generally, we follow the terminology of Harary.} 
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rearrangeable graphs with given J and with Q having the least possible 
number of edges. 

In this paper, we derive lower bounds on the minimum number of 
edges that a rearrangeable graph can have (see Theorem 1 in Section 
II). In addition, we also construct rearrangeable graphs which meet 
these bounds so that these graphs are optimal by this measure. Finally, 
we consider a generalization of rearrangeability, called k-rearrange- 
ability, and we solve the corresponding problems in this case as well. 

This study was motivated by questions of rearrangeability in 
switching networks (see Ref. 3). The sets J and 2 correspond to the 
sets of inlets and outlets, respectively ; an edge {x, y} of G corresponds 
to a crosspoint between xz and y. The optimal rearrangeable graphs we 
construct can consequently be used to form efficient rearrangeable 
(and nearly rearrangeable) switching networks of arbitrary size. 


Il. BASIC PROPERTIES OF REARRANGEABLE GRAPHS 


Let G be a rearrangeable graph with distinguished subsets J and Q, 
where we assume without loss of generality that |I| = n S$ m= |Q|. 
For the bulk of the paper, we shall restrict ourselves to the special case 
that S is empty, ie., V(G) = IU Q. 

If {x, y} is an edge of G, we say that x and y are adjacent and we write 
x~y. Similarly, for T C V(G), the notation « ~ T' will denote that 
x ~t for some ¢t € T. By the degree of v € V(G), written deg (v), we 
mean the number of edges of G containing v. More generally, if 
X CV(G), then degx (v) denotes 


[{{v, e}| {v, 2} © H(G) and x € X}|. 


Suppose there is a vertex v C I with deg (v) = k <n, and let 
V1, °°, ¥; denote the vertices that are adjacent to v. 

Now consider an assignment A in which all the v;, 171s hk, 
occur as well as the pair (v, v’), where v’ € Q is not adjacent to v. But 
this A is not realizable in G, which contradicts the hypothesis that G 
is rearrangeable. Hence, for all v €C J, we must have deg(v) 2 n. By 
a similar argument, it can be shown that deg (v’) 2 n for all vo’ EQ. 
Thus, for any rearrangeable graph G we must have: 


Fact 1: For all v € V(G), deg (v) = n. 


Let us now state several more elementary facts about rearrangeable 
graphs G which can be proved in much the same way as Fact 1. 


Fact 2: For all v € I, dege (v) 2 n. 
Fact 3: For all v © V(G), max [degr (v), dege (v) |] 2 n. 
Fact 4: lIfv~ TI andv~Q and |If)Q| = 0, then deg (v) 2n +1. 
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Lemma 1: In a rearrangeable graph G with vertex set V(G) = ITU Q, 
|Z] =n<m= {Q|, and |I()Q| =0, there are at least n vertices 
with degree greater than n. 


Proof: If allv € J satisfy dege (v) 2 n + 1, then we are done. Suppose 
there is an element v € I satisfy degg (v) = n, say, v 18 adjacent to 
V1, V2, -*+, Un. If any v;, say, 1, is not adjacent to Q, let us consider an 
assignment in which all the v, 2572 n, occur as well as the 
ordered pair (v, v’), where wv’ is a vertex in 2 different from any 2. 


However, since |Z| = n, it is impossible for G to satisfy this assign- 
ment. Hence, all the v; are adjacent to both J and 2. By Fact 4, deg »; 
=n-+1forz = 1, ---, n, which proves Lemma 1. 


Lemma 2: If G is any rearrangeable graph with vertex set I \U Q, which 
has |Q| > |I| = n and |I()Q| = 0, then G has at least n(p + 1)/2 
edges, where p = |V(G)|. 


Proof: The number of edges in G satisfies the following inequality: 


|B(@)| = e(@) = 4 & deg ©) 
a[(p — n)n + (n+ In] 


Il [IV 


4n(p + 1). 


Lemma 8: In a rearrangeable graph G with vertex set V(G) = IU Q, 
which has |I| =n < m= |Q| < 2n and |I (1) Q| = 0, we have 


e(G) = nm — 4 (m — n—I1) (mM — n). 


Proof: Denote the vertices in I by 21, ---, tn. Suppose the vertex 7; 1s 
adjacent to d; vertices in 2, where we may assume d; S dz S --: S dn. 
Let 2; be the union of {2;} and the d; elements in Q, which are adjacent 
to 7; By Fact 3, each element in 2 \_Q; is then adjacent to at least 
n—(m-—d;) +1 elements in Q;. Hence, by counting the total 
number of edges e(G) and using the fact that d, S d; for all z, it follows 
that 


e(G) = ndi + (m — di) (n — m+ di + 1) + 4(m — di) (m — di — 1). 


But the right-hand side is minimized by choosing d; as small as possible. 
Thus, since m < 2n, then by Fact 3, we have 


e(G) = mn — 4(m — n)(m — n — 1), 


which proves the lemma. 
The preceding inequalities are summarized in the following result. 


Theorem 1: In a rearrangeable graph with vertex set V(G) = IU Q, and 
lI} =nsm= |Q|, |Q NK TI| = 0, the number of edges e(G) satisfies 
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nim +n-+ 1) 
e(G) = a ym & 2n, 


[mn — 4(m — n)(m —n—1)] tf 2n>m2n, 


where [x] denotes the smallest integer which is greater than or equal to x. 
The proof follows at once from Lemma 2 and Lemma 3. 


lll. OPTIMAL REARRANGEABLE GRAPHS—MANHATTAN GRAPHS 


In this section, we give a construction for a class of optimal re- 
arrangeable graphs. The number of edges in these graphs will meet 
the lower bound in Theorem 1. These graphs will be called Manhattan 
graphs because they resemble a number of bridges connecting a high- 
density metropolitan area and low-density suburban areas. 

A Manhattan graph with vertex set V(G) = J U © will be denoted 
by MCU, Q). If |Z| =n, |Q| = m, and |Z (1) Q| = 0, MU, Q) is also 
denoted by M(n, m). 

In this section, we give the construction of M(n, m) for any n and 
m by considering the following cases. 


Case 1, n = m: The Manhattan graph M(n, n) is the complete bi- 
partite graph K,.,n, 1.e., there is an edge between every pair of vertices 


(u,v)jvElIvEad. 


Case 2,n <m < 2n: We shall specify the edges of M(n, m) by giving 
the subgraph spanned by various subsets of vertices of M(n, m). The 
spanning subgraph of a set S C V(G) is the subgraph of G with edge 
set {{x, y}| {x,y} © H(G) and z, y € S}. 
Let 
[T= {74, 12, res, tats 
Q = {X1, Lay ty Uny Yrs Y2, °° 5 yc}, 
where 
t=m—n. 
M (n, m) will be constructed as follows: 
(<) The spanning subgraph of the vertices J U {a:{|1 Si Sn} 
in M(n, m) is a complete bipartite graph Kn,,; 
(22) y; is adjacent to 2;, Uji, °°, Liecn_n), for 7 = 1, 2, +++, 0; 
(77t) The spanning subgraph of the vertices {y1, yz, -°-, ye} in 
M (n,m) is a complete graph K;,. 
The graph M(n, m) is clearly rearrangeable. As an example of this 
construction, we illustrate M(3, 5) in Fig. 1. 


Case 8, 2n < m < 3n: The construction scheme for M(n, m) in this 
case may be described as follows: 
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Fig. 1—Graph M (3, 5). 


Let 
[= {721, 2, rs 
Q = {Xa, B27 °* %y Bny Yr, Y2y + °°, Yny Hy &2, See, 


where 
t= m — 2n. 


(1) The spanning subgraph of the vertices J U {21, x2, +++, tn} 
in M (n, m) is a complete bipartite graph K.,,,; 

(12) x; is adjacent to y; forj = 1, 2, ---, n; 

(272) 2; 18 adjacent to y1, yo, -++, yn forj = 1, 2, ---, t; 

(zv) The spanning subgraph of vertices {y1, yz, --+, yn} in M(n, m) 
is any graph with degree sequence 


Bare Wet a SS Tw 
re Ee ae Ds ee ee EE 
n — 1 times, 


where 
_|jn—t—1 ifn(m —t + 1) 1s even, 
~ In —#~ otherwise. 


From a well-known theorem of Erdés and Gallai,* a graph with this 
degree sequence can always be constructed. The graphs M (38, 7) and 
M (8, 8) are shown in Fig. 2 as examples of this construction. 

We want to show this graph is rearrangeable. Given an assignment 


A involving vertices Lay, Lag) °° * 5) Lanys Yor» Yoo * °° Ybnoy Scr» Sea) °°) Feng) 
it is clear that n = ni + ne + nz. 
? . 
We may assume nz = 2 + ng, where both 2; and y,, 7 = 1, 2, ---, 


Nz, appear in A. 
If t — nz3 = ny, it is easy to see this graph is rearrangeable. 


Suppose ¢ — n3 < ng. Let us consider that the set S; = {yo,{1 
<7 na, y; is adjacent to y,, and both ys; and x, do not occur in A}. 
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(b) 
Fig. 2—(a) Graph M (8, 7). (b) Graph M (3, 8). 


Since |S;| 2n—t— m1 — nz for all 1 Sj Sn, we know that at 


least n —t— ns — ng of the no requests involving Yyo,, Yb ***) Yoo 
can be connected. 
If 
mg 2n—-t—m— mM, 
then 
Ng —- (n—-t— 1m — mM) =t— (n— m1 — nm — Nz) 
< t— 3. 


After the remaining nz — (n — t — m1 — ng) requests are connected 
by a path passing through some of the ¢ — n3 z,’s, which do not occur 
in A, the requests involving the 2p,’s or the 2,,’s can be easily connected. 
Thus, we have proved that the graph M (n, m) is rearrangeable. 

For the case m = 3n — 1, there is another type of Manhattan 
graph which is a special case of the following class of graphs. 

Case 4,m = h(n —1) + 2n,h 21: 
Let 
[= {t1, te, reas tnt, 
Q = {X1, Lay e+, Lay Yt, Yay 1) Yny 21y 22y 0) S(n-ayn he 


The graph M(n, m) is constructed as follows: 
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(1) The spanning subgraph of vertices J U {x:|1 S 7 S n} is the 
complete bipartite graph K,.»; 
(12) x; 1s adjacent to y;,7 = 1, 2, ---, n; 
(412) There is a cycle with vertices y1, 21, 22, ---, 2h) Ya, hey ° °°, Sh; 
Y3, °° °) Yn, V1; 
(iv) There is a complete graph K,»_1 with vertex set {2::.]7 = 0, 
1,---,n — 2} fori = 1, 2,---,h. 
(v) yi 1s adjacent tO Y1, Ya, ++ °, Yin, Yire, + 7%) Yn for 2 = ay 
Ne 
y1 is adjacent to Ys, Y4, ++) Yn—1- 
Yn 1S adjacent to Yo, Y3, °+*, Yn—2 


As an example of this construction, we illustrate (7 (4, 14) in Fig. 38. 

To see that this graph is rearrangeable, let us consider an assignment 
in which Hayy Vag °° *y Vanyy Yor Yoo) ** 'y Ydnos and @eyy “cay °° *y Sen, OCCUY. 
Because of the structure of this graph, any request involving the z,; or 
yp, can easily be connected after the n3 requests involving the 2,,’s are 
connected. 

It is clear that n 2 ni + ne + n3. First, let us consider the special 
case Ny = ne = 0. Let P;,;,7 < 7, denote the path y;, 2¢:-1ynga, 2G—1n+2, 

+) 2iny Yirt, «°°, Y; Tf each of Pio, Po,3, +--+, Pn—1,n contains one of 
the z,,; except for one P;,::1, then the assignment can be satisfied. If 
more than one of P1,2, P23, ---, Pn—1,n contains more than one 2,,’s, 
say P1,2 contains 2,,, 2, and P3,4 contains 2,3, 2-,, we know that at least 
one of the P;,:41’s does not contain any 2,,, say P:t41. Instead of 
considering the assignment A, it suffices to consider the assignment 
involving {24¢-1)n} U {2e,|¢ = 2, 8, ---, n3}. Continuing this argu- 
ment, it is enough to consider an assignment satisfying the property 
that all the z,, involved appear in distinct P;,;,1’s except for two of 
them and, therefore, this assignment is realizable. 

Now, for arbitrary ni and no, let S = {a1, de, +++, An,, 61, be, «++, Ong}. 
Relabel S by S = {s1 < 82 < +--+ < Sn}, nm’ S ni + me, and consider 





Fig. 3—Graph M (4, 14). 
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the set T = {P.,|¢ <j,4,j ES anditi,it+2,--,7-1ES}. 
Then |7T| =n — |S8| —1. 
Let 


k= {z:|2 =¢1, C2, °° Cus U ys|9 <= {bi, bo, an -, Dna} \{ ar, i +) On} }. 


If all paths in 7 contain one element of R except one path which 
contains two elements of R, then the assignment is clearly realizable. 
Otherwise, we may use an argument similar to the one above to 
establish the rearrangeability of M (n, m). 


Case 5,m 2 8n and m = 2n+h(n— 1) +t,0 <t <n — 1: In this 
case, the graph is a combination of a graph of Case 3 and a graph of 
Case 4 except for minor modifications. Let 


a = {71, 19, Tepe tn}, 
Q = (2, U2, °° 4, Lny Yr, Y2, ° °°) Yny %1) ©2y °°, &(n—-1)hy W1, Wa, °°; Wr}. 
(1) If t #4 n — 2, let us delete all the edges of the form {y;, y;}, 
171,75 nin M(n, 2n +2) in Case 3. We then construct a 
cycle of vertices y1, 21, 22, °° +, 2n—1, Ya, °° *) Yn; Y1. Lhe spanning 
subgraph of vertices {21, 22, +++, 2cn_1yn} are the same as that 
in M(n, 2n + (n — 1)h) in Case 4. The spanning subgraph 





S 
ZN . ; 
eZ 
(b) 
Fig. 4—(a) Graph M (5, 16). (b) Graph M (5, 17). 
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of vertices {y1, Y2, -°*, Yn} with the exception of the edge yiyn, 
is any graph with degree sequence 


(n—-—t—3,n—t—3,---,n—t—3,w) 
SS EEEEEEEEEEEEEEEE nS EEESEEEnnEEEIEnenee meee Vee ee 
n — 1 times, 
where 
n—t—3 if n(n —¢t — 8) is even, 
n—t—2 otherwise, 
and 
Yi Yin, t = 1,2, ---,n —1. 
(zz) If t = n — 2, the construction scheme is as follows: 


(a) The spanning subgraph of vertices J U {21, 22, -++, Ln} 


1s: Mae: 
(b) x; 1s adjacent to y;,7 = 1, 2, ---, 7; 
(c) There is a path yi, 21, +++, 2, Yo 2epay +7; Zany Yay 17 Yor} 
(d) There is a complete graph k,_1 with vertex set {z::;,|7 = 0, 
1,---,n — 2} forj = 1, 2, ---, A. 


(e) When n = 8, wi 1S connected to any y;. If n # 3, we have 
the following: 


w;, t= 1,2, ---,n — 2, 1s adjacent to all y; except yisi; 
If nis even, then wa-1 ~ wai, t = 1, 2, ---, [n/2] — 1. 
If n is odd, then Wai—-1 “™ Wai, 1= l, 2, oe [n/2 | ars 1, 


Wn—3 ™ Wn—2- 


It is an easy exercise to show that the Manhattan graph M(n, m) 
thus constructed is rearrangeable. As examples of this construction, 
we illustrate 1/7 (5, 16), (5, 17) in Fig. 4. 

By a direct calculation, it is easy to verify that all the Manhattan 
graphs we constructed in Cases 1 through 5 achieve the lower bounds 
on all rearrangeable graphs for given J and Q with |Z (1) Q| = 0. From 
this, the following result is immediate. 


Theorem 2: The Manhattan graphs M(n,m) are optimal rearrangeable 
graphs. 


We note that a complete bipartite graph Kn» has nm edges. Thus, 
by Theorem 1, a Manhattan graph has precisely [$(m — n — 1) 
xX min (n, m — n) | fewer edges than K,,m. When m is large compared 
to n, this is approximately 3nm. 


IV. MANHATTAN GRAPHS FOR THE CASE OF |/()2| 40 


Let us now consider an optimal rearrangeable graph with vertex 
set 1 UQand |I1) Q| #0. 
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If {Q\Z| 2 |Z|, then the Manhattan graph M (J, 2) will be taken 
to be the same as M(I, Q\J). To prove that the Manhattan graph 
MI, Q) is an optimal rearrangeable graph for given J, Q, and |Q\1| 
>= |I|, we need only show that M(J, Q) is rearrangeable. Any re- 
quest (2, y) €C A, x EI, y € QNS, can be connected in M(I, QNJ) 
as well as in M(J, Q). If the assignment contains some request (2, y), 
where both x and y are in J, they can be successfully joined by a path 
of length 2 via some vertex in Q\I. 

When |Q\J| s |Z| — 1, the following construction suggested by 
F. K. Hwang? suffices for M(I,Q). © 

Let M(/,2) be the union of a complete graph K, and a three- 
partite graph Kim. aS shown in Fig. 5b, where n = |J\QI], 
m= |O\I|, l= |QQ]|. To illustrate this construction more 
clearly, we denote the graph J, to be the graph of n vertices without 
any edge. If two graphs G and H are joined by two thick lines, as 
shown in Fig. 5a, there is an edge connecting any vertex in G to any 
vertex in A. 

We note that no edge in the above graph can be deleted without 
destroying the rearrangeability of the graph for the given J, 2. Thus, 
we can state the following result. 


Theorem 8: The Manhattan graph MCI, Q) 1s an optimal rearrangeable 
graph for any given I, Q. 


V. ke-REARRANGEABLE GRAPHS 


A graph is said to be rearrangeable of capacity k or k-rearrangeable if 
it satisfies any assignment A of size at most k, 1e., 


A = { (X1, Y1); met, (Xt, yi) }, tsk 


A rearrangeable graph is easily seen to be a special case of a k-re- 


arrangeable graph with k = [J]. 


(b) 


Fig. 5—(a) Complete connection between two graphs. (b) Graph M(/,Q) with 
n= [iXQ|,m = [9X7], 0 = [2N TI, 
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Assuming |Q| 2 |Z| 2 k, a k-rearrangeable graph G with vertex 
set V(G) = J U 2 has the following properties (which are similar to 
those of a rearrangeable graph). 


Fact 1’: For all v € V(G), deg (v) 2 k. 

Fact 2’: If v € I, dege (v) = k. 

Fact 3’: For any v € V(G), max [deg (v), dege (v) | = k. 
Fact 4’: lf vu ~ I,v ~Q, then deg (v) 2k + 1. 


Lemma 1’: The number of edges in a k-rearrangeable graph with vertex set 
VG) =I1UQ, |I| >k, |Q\I| > k, p = |V(Q)|, satisfies 


o(@) = | Het). 


Proof: If there is an x € J which is not adjacent to J, then the spanning 
subgraph G’ of the vertex set V(G)\ {v} in G must be k-rearrangeable. 
If |J| =k +1, then G’ has at least 4kp edges. If |I| > & + 1, then 
G’ has more than $kp edges (by induction). In any case, G has at 
least 4k(p + 2) edges. Similarly, G must have 23k(p + 2) edges if 
there is an y € Q which is not adjacent to 2. 

If all vertices in J are adjacent to J and all vertices in 2 are adjacent 
to 2, consider the sets 


= {tElli«r~a,zr~ Tt, 
= tyEQly~ Ly ~ Qh. 


| 


Any element in J’ or has degree =k +1 and also [J/’| 2 k, 


|Q’| 2 k. Hence, 


Similar to Theorem 1, we have Theorem 4. 


Theorem 4: The number of edges in a k-rearrangeable graph with vertex 
st V(G)=I1UQ, |I| =n, |Q) =m k<nsm, [INQ] =90, 
satisfies 


rak(m +n+2)] of 2kSnem, 
[s{k(m+n+1) +tk-t4+ 1} 1 
e(G) = wk<n<2k sm, n=k +t, 
[skim +n) +tk -t+)4+/k-t+)}1 
uk<nSim<2k, n=k+t m=k+t. 


If J and © are disjoint, an optimal k-rearrangeable graph can be 
constructed by combining two optimal rearrangeable graphs M(k, n), 
M(k,m) by overlapping K;,, as shown in Fig. 6. These are called 
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Fig. 6—Graph M,(14,16). 


Manhattan k-graphs and are denoted by M;(n, m), where n = |J|, 
m= |Q|. 

Because M (k, n), M(k, m) are rearrangeable, the k-rearrangeability 
of M,(n, m) follows immediately. 

If |QA) 7] =1, then M,(/,Q) is the same as M,(|JZ|, |Q|) — 1). 
If |Q9A I) =2 and |QAI| 2&, then M,(7,Q) is M,({I| — 1, 
}Q} — 1). 

We notice that M,(n,m) = M(n, m). 


Theorem 5: Manhattan k-graphs are optimal rearrangeable graphs of 
capacity k for given I, Q where |Q| = |I| >k, |QQ TI| S 2, |Q\I| =k. 


As we noted earlier, Manhattan graphs have considerably fewer 
edges than the corresponding complete bipartite graphs with the 
same vertex sets. This is also the case for Manhattan k-graphs as well. 
In particular, the number of edges saved is 


[mn — ${k(m + n) + max [k, (n — k)(2k —n + 1)] 
+ max [k, (m — k)(2k — m + 1) ]} ]. 


When |Q()/| is large, alternate constructions of k-rearrangeable 
graphs for given J and Q can be given by adding k additional vertices, 


\ Mu(k, IQ\IT) 
‘\ 4 


rt ae 


—— 


Fig. 7—k-rearrangeable graph with Steiner vertices. 
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called Sterner vertices. For example, we may consider the following 
graph with vertex set J\UQ2US and |S| =k as shown in Fig. 7. 


(2) The spanning subgraph of J and S is a complete bipartite 
graph Ky x. 

(it) The spanning subgraph of S and Q\/ is precisely the Man- 
hattan graph M,(k, |Q\J]). 


This graph is clearly k-rearrangeable. 


VI. GRAPH REPRESENTATIONS OF A SWITCHING NETWORK 


Consider a graph G with vertex set V(@). Let J and 2 be nonempty 
subsets of V(G) and let S = V(G@)\.UU Q), which we shall call the 
Steiner set of G. 

The graph G corresponds to a switching network in the following 
way: 


(2) I < inlet lines. 

(22) Q < outlet lines. 
(4722) Edge {z, y} <> a crosspoint between x and y. 
(zv) S © additional lines. 


For example, the rectangle network in Fig. 8 corresponds to the 
complete bipartite graph K3,4. 

The Manhattan graph (3,5) of Fig. 1 corresponds to the re- 
arrangeable network shown in Fig. 9. | 

An example of a network derived from the graph in Fig. 10a with 
a nontrivial Steiner set is shown in Fig. 10b. 

In this way, a switching network can be represented by a graph. 
A rearrangeable graph then corresponds to a rearrangeable network. 
A k-rearrangeable graph corresponds to a rearrangeable network of 
capacity k. 

Many problems in switching networks can in this way be viewed as 
graph-theoretic problems. Instead of minimizing the number of 





7 7S THE DIAGONAL LINE SEGMENT 
wee AT THE INTERSECTIONS OF 

= TWO “LINES” (= VERTICES) 

REPRESENTS A CONNECTION 

(OR CROSSPOINT) BETWEEN 

THESE LINES. 


Fig. 8—Rectangle network of size 3 X 4. 
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Fig. 9—Rearrangeable network corresponding to graph M (38, 5). 


crosspoints to reduce the cost of building a network, we consider the 
problem of finding a graph with the least possible number of edges. 
The size of S in the graph representation of a switching network 
determines how many lines we have to use in addition to the inlet 
and outlet lines. The Manhattan graph we have constructed then 
provides a model of a rearrangeable network with a minimum number 
of crosspoints for the case that the size of S is 0. 


Vil. CONCLUDING REMARKS 


Almost all previous results on rearrangeable networks dealt with 
rearrangeable graphs having |J| = |Q| and |Z ()Q| = 0. Benes? 


(a) (b) 


Fig. 10—(a) Graph with J = {1, 2,3}, and Q = {4, 5,6, 7}, and S = {8, 9, 10}. 
(b) Network corresponding to (a). 
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has shown that a rearrangeable network for |ZJ| = |Q| = n can be 
constructed with slightly more than O(n log 7) crosspoints, which is 
just the information-theoretic lower bound. However, |S]| is required 
to be arbitrarily large to approach the O(n log n) bound. This result 
was later refined by Waksman and Joel.® 

When just one middle stage is allowed, Preparata’ gave a lower 
bound on the number of crosspoints in a k-rearrangeable network and 
showed several optimal designs for arbitrary sizes of J and Q. 

With regard to nonblocking networks, we can define nonblocking 
graphs as those which satisfy the following property : The vertex set of a 
nonblocking graph G is V(G) = JU QUS, where S is disjoint from 
I and ©. For any assignment A = {(2;, yi)|t = 1, 2, ---, }, we can 
find a path connecting zx; and y; without disturbing the existing paths 
already connecting xz; and y;, 1 $7 <7. In other words, there is 
always a path connecting x; and y; whose vertices and edges are 
disjoint from those of the previous paths. 

If the vertex set of a nonblocking network is the union of J and Q, 
one class of nonblocking graphs we can construct is formed from the 
union of a three-partite graph Ky. ,. and a complete graph K,, where 
[I 2) =1, |Z| = n, and |Q| = m, as shown in Fig. 5. 

Bassalygo and Pinsker® have shown by a nonconstructive argument 
that there exist nonblocking networks with O(n log n) crosspoints, 
where |J| = |Q| =n and the size of S approaches infinity. The 
best known construction, due to Cantor,® requires O[n(log n)? | 
crosspoints. 
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